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FIGURE 1 

CCAATCGCCCGGTGCGGTGGTaCAGGGTCTCGGGCTAGTCASSQCGTCCCCGTCTCGGAGAC 
TGCAGACTAAACCAGTCATTACTTGTTTCAAGAGCGTTCTGCTAATCTACACTTTTATTTTC 
TGGATCACTGGCGTTATCCTTCTTGCAGTTGGCATTTGGGGCAAGGTGAGCCTGGAGAATTA 
CTTTTCTCTTTTAAATGAGAAGGCCACCAATGTCCCCTTCGTGCTCATTGCTACTGGTACCG 
TCATTATTCTTTTGGGCACCTTTGGTTGTTTT'GCTACCTGCCGAGCTTCTGCATGGATGCTA 
AAACTGTATGCAATGTTTCTGACTCTCGTTTTTTTGGTCGAACTGGTCGCTGCCATCGTAGG 
ATTTGTTTTCAGACATGAGATT7UVGAACAGCTTTAAGAATAATTATGAGAAGGCTTTGAAGC 
AGTATAACTCTACAGGAGATTATAGAAGCCATGCAGTAGACAAGATCCAAAATACGTTGCAT 
TGTTGTGGTGTCACCGATTATAGAGATTGGACAGATACTAATTATTACTCAGAAAAAGGATT 
T C CTAAGAGTTG CTGT AAACTTGAAGATTGTACTCCACAGAGAGATGC AGACAAAGTAAAC A 
ATGAAGGTTGTTTTATAAAGGTGATGACCATTATAGAGTCAGAAATGGGAGTCGTTGCAGGA 
ATTTCCTTTGGAGTTGCTTGCTTCCAACTGATTGGAATCTTTCTCGCCTACTGCCWCTCTCG 
TGCCATAACAAATAACCAGTATGAGATAGTG TAA CCCAATGTATCTGTGGGCCTATTCCTCT 
CTACCTTTAAGGACATTTAGGGTCCCCCCTGTGAATTAGAAAGTTGCTTGGCTGGAGAACTG 
ACAACACTACTTACTGATAGACCAAAAAACTACACC AGTAGGTTGATT CAAT CAAG ATGTAT 
GTAGACCTAAAACTACACCAATAGGCTGATTCAATCAAGATCCGTGCTCGCAGTGGGCTGAT 
TCAATCAAGATGTATGTTTGCTATGTTCTAAGTCCACCTTCTATCCCATTCATGTTAGATCG 
TTGAAACCCTGTATCCCTCTGAAACACTGGAAGAGCTAGTAAATTGTAAATGAAGT 
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24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
extracellular domain of of PRO polypeptide. 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 

27. An isolated nucleic acid encoding the polypeptide of any one of Claims 23 to 26. 
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22. An isolated nucleic acid which has at least 80% sequence identity to the full-length coding 
sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID NO:3), 
Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 7 (SEQ ID NO:9), Figure 9 (SEQ ID NO: 11), 
Figure 11 (SEQ ID NO: 16), Figure 13 (SEQ ID NO:21), Figure 15 (SEQ ID NO:23), Figure 17 (SEQ ID 
NO:28), Figure 19 (SEQ ID NO:30), Figure 21 (SEQ ID NO:32), Figure 23 (SEQ ID NO:40), Figure 25 (SEQ 
5 ID NO:42), Figure 27 (SEQ ID NO:49), Figure 29 (SEQ ID NO:51), Figure 31 (SEQ ID NO:53), Figure 33 
(SEQ ID NO:55), Figure 35 (SEQ ID NO:57), Figure 37 (SEQ ID NO:62), Figure 39 (SEQ ID NO:67), Figure 
41 (SEQ ID NO:69), Figure 43 (SEQ ID NO:71), Figure 45 (SEQ ID NO:76), Figure 47 (SEQ ID NO:78), 
Figure 49 (SEQ ID NO:83), Figure 51 (SEQ ID NO:85), Figure 53 (SEQ ID NO:87), Figure 55 (SEQ ID 
NO:94), Figure 57 (SEQ ID NO:99), Figure 59 (SEQ ID NO:101), Figure 61 (SEQ ID NO:103), Figure 63 

10 (SEQ ID NO: 1 10), Figure 65 (SEQ ID NO: 1 15), Figure 67 (SEQ ID NO: 1 17), Figure 69 (SEQ ID NO: 122), 
Figure 71 (SEQ ID NO:127), Figure 73 (SEQ ID NO:129), Figure 75 (SEQ ID NO:131), Figure 77 (SEQ ID 
NO: 133), Figure 79 (SEQ ID NO: 135), Figure 81 (SEQ ID NO: 137), Figure 83 (SEQ ID NO: 139), Figure 85 
(SEQ ID NO:141), Figure 87 (SEQ ID NO:143), Figure 89 (SEQ ID NO:145), Figure 91 (SEQ ID NO:147), 
Figure 93 (SEQ ID NO: 152), Figure 95 (SEQ ID NO: 157), Figure 97 (SEQ ID NO: 159), Figure 99 (SEQ ID 

15 NO: 161), Figure 101 (SEQ ID NO: 169), Figure 103 (SEQ ID NO: 179), Figure 105 (SEQ ID NO: 188), Figure 
107 (SEQ ID NO:193), Figure 109 (SEQ ID NO:195), Figure 111 (SEQ ID NO:197), Figure 113 (SEQ ID 
NO:202), Figure 115 (SEQ ID NO:209), Figure 117 (SEQ ID NO:211), Figure 119 (SEQ ID NO:213), Figure 
121 (SEQ ID NO:215), Figure 123 (SEQ ID NO:217), Figure 125 (SEQ ID NO:219), Figure 127 (SEQ ID 
NO:224), Figure 129 (SEQ ID NO:226), Figure 131 (SEQ ID NO:228), Figure 133 (SEQ ID NO:233), Figure 

20 135 (SEQ ID NO:235), Figure 137 (SEQ ID NO:242), Figure 139 (SEQ ID NO:247), Figure 141 (SEQ ID 
NO:252), Figure 143 (SEQ ID NO:259), Figure 145 (SEQ ID NO:264), Figure 147 (SEQ ID NO:266), Figure 
149 (SEQ ID NO:268), Figure 151 (SEQ ID NO:270), Figure 153 (SEQ ID NO:272), Figure 155 (SEQ ID 
NO:274), Figure 157 (SEQ ID NO:276), Figure 159 (SEQ ID NO:281), Figure 161 (SEQ ID NO:286), Figure 
163 (SEQ ID NO:291), Figure 165 (SEQ ID NO:296), Figure 167 (SEQ ID NO:301), Figure 169 (SEQ ID 

25 NO:303), Figure 171 (SEQ ID NO:305), Figure 173 (SEQ ID NO:307), Figure 175 (SEQ ID NO:309), Figure 
177 (SEQ ID NO:314), Figure 179 (SEQ ID NO:316), Figure 181 (SEQ ID NO:32I), Figure 183 (SEQ ID 
NO:323), Figure 185 (SEQ ID NO:325), Figure 187 (SEQ ID NO:327), Figure 189 (SEQ ID NO:329), Figure 
191 (SEQ ID NO:331), Figure 193 (SEQ ID NO:333), Figure 195 (SEQ ID NO:335), Figure 197 (SEQ ID 
NO:337), Figure 199 (SEQ ID NO:339), Figure 201 (SEQ ID NO:346), Figure 203 (SEQ ID NO:351), Figure 

30 205 (SEQ ID NO:353), Figure 207 (SEQ ID NO:355), Figure 209 (SEQ ID NO:357), Figure 211 (SEQ ID 
NO:363), Figure 213 (SEQ ID NO:365), Figure 215 (SEQ ID NO:371), Figure 217 (SEQ ID NO:373), Figure 
219 (SEQ ID NO:375), Figure 221 (SEQ ID NO:377), Figure 223 (SEQ ID NO:382), Figure 225 (SEQ ID 
NO:384), Figure 227 (SEQ ID NO:389), Figure 229 (SEQ ID NO:394), Figure 231 (SEQ ID NO:396), Figure 
233 (SEQ ID NO:401), Figure 235 (SEQ ID NO:405), Figure 237 (SEQ ID NO:409), Figure 239 (SEQ ID 

35 NO:414), Figure 241 (SEQ ID NO:422), Figure 242 (SEQ ID NO:428) and Figure 245 (SEQ ID NO:430). 

23. An isolated extracellular domain of a PRO polypeptide. 
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21. An isolated nucleic acid which has at least 80% sequence identity to a nucleic acid sequence 
selected from the group consisting of that shown in Figure 1 (SEQ ID NO:3), Figure 3 (SEQ ID NO:5), Figure 
5 (SEQ ID NO:7), Figure 7 (SEQ ID NO:9), Figure 9 (SEQ ID NO: 11), Figure 11 (SEQ ID NO: 16), Figure 
13 (SEQ ID NO:21), Figure 15 (SEQ ID NO:23), Figure 17 (SEQ ID NO:28), Figure 19 (SEQ ID NO:30), 
Figure 21 (SEQ ID NO:32), Figure 23 (SEQ ID NO:40), Figure 25 (SEQ ID NO:42), Figure 27 (SEQ ID 
5 NO:49), Figure 29 (SEQ ID NO:51), Figure 31 (SEQ ID NO:53), Figure 33 (SEQ ID NO:55), Figure 35 (SEQ 
ID NO:57), Figure 37 (SEQ ID NO:62). Figure 39 (SEQ ID NO:67), Figure 41 (SEQ ID NO:69), Figure 43 
(SEQ ID NO:71), Figure 45 (SEQ ID NO:76), Figure 47 (SEQ ID NO:78), Figure 49 (SEQ ID NO:83), Figure 
51 (SEQ ID NO:85), Figure 53 (SEQ ID NO:87), Figure 55 (SEQ ID NO:94), Figure 57 (SEQ ID NO:99). 
Figure 59 (SEQ ID NO: 101), Figure 61 (SEQ ID NO: 103), Figure 63 (SEQ ID NO: 110), Figure 65 (SEQ ID 

10 NO:l 15), Figure 67 (SEQ ID NO:l 17), Figure 69 (SEQ ID NO: 122), Figure 71 (SEQ ID NO: 127), Figure 73 
(SEQ ID NO:129), Figure 75 (SEQ ID NO:131), Figure 77 (SEQ ID NO: 133), Figure 79 (SEQ ID NO:135), 
Figure 81 (SEQ ID N0:137), Figure 83 (SEQ ID NO:139), Figure 85 (SEQ ID NO:141), Figure 87 (SEQ ID 
NO: 143), Figure 89 (SEQ ID NO:145), Figure 91 (SEQ ID NO:147), Figure 93 (SEQ ID NO:152), Figure 95 
(SEQ ID NO:l57), Figure 97 (SEQ ID NO:159), Figure 99 (SEQ ID NO:161), Figure 101 (SEQ ID NO:169), 

15 Figure 103 (SEQ ID NO:179), Figure 105 (SEQ ID NO:188), Figure 107 (SEQ ID NO:193), Figure 109 (SEQ 
ID NO: 195), Figure 111 (SEQ ID NO: 197), Figure 113 (SEQ ID NO:202), Figure 115 (SEQ ID NO:209), 
Figure 1 17 (SEQ ID NO:21 1), Figure 1 19 (SEQ ID NO:213), Figure 121 (SEQ ID NO:215), Figure 123 (SEQ 
ID NO:217), Figure 125 (SEQ ID NO:219), Figure 127 (SEQ ID NO:224), Figure 129 (SEQ ID NO:226), 
Figure 131 (SEQ ID NO:228), Figure 133 (SEQ ID NO:233), Figure 135 (SEQ ID NO:235), Figure 137 (SEQ 

20 ID NO:242), Figure 139 (SEQ ID NO:247), Figure 141 (SEQ ID NO:252), Figure 143 (SEQ ID NO:259), 
Figure 145 (SEQ ID NO:264), Figure 147 (SEQ ID NO:266), Figure 149 (SEQ ID NO:268), Figure 151 (SEQ 
ID NO:270), Figure 153 (SEQ ID NO:272), Figure 155 (SEQ ID NO:274), Figure 157 (SEQ ID NO:276), 
Figure 159 (SEQ ID NO:281), Figure 161 (SEQ ID NO:286), Figure 163 (SEQ ID NO:291), Figure 165 (SEQ 
ID NO:296), Figure 167 (SEQ ID NO:301), Figure 169 (SEQ ID NO:303), Figure 171 (SEQ ID NO:305), 

25 Figure 173 (SEQ ID NO:307) f Figure 175 (SEQ ID NO:309), Figure 177 (SEQ ID NO:314), Figure 179 (SEQ 
ID NO:316), Figure 181 (SEQ ID NO:321), Figure 183 (SEQ ID NO:323), Figure 185 (SEQ ID NO:325), 
Figure 187 (SEQ ID NO:327), Figure 189 (SEQ ID NO:329), Figure 191 (SEQ ID NO:331), Figure 193 (SEQ 
ID NO:333), Figure 195 (SEQ ID NO:335), Figure 197 (SEQ ID NO:337), Figure 199 (SEQ ID NO:339), 
Figure 201 (SEQ ID NO:346), Figure 203 (SEQ ID NO:351), Figure 205 (SEQ ID NO:353), Figure 207 (SEQ 

30 ID NO:355), Figure 209 (SEQ ID NO:357), Figure 211 (SEQ ID NO:363), Figure 213 (SEQ ID NO:365), 
Figure 215 (SEQ ID NO:37I), Figure 217 (SEQ ID NO:373), Figure 219 (SEQ ID NO:375), Figure 221 (SEQ 
ID NO:377), Figure 223 (SEQ ID NO:382), Figure 225 (SEQ ID NO:384), Figure 227 (SEQ ID NO:389), 
Figure 229 (SEQ ID NO:394), Figure 231 (SEQ ID NO:396). Figure 233 (SEQ ID NO:401), Figure 235 (SEQ 
ID NO:405), Figure 237 (SEQ ID NO:409), Figure 239 (SEQ ID NO:414), Figure 241 (SEQ ID NO:422), 

35 Figure 242 (SEQ ID NO:428) and Figure 245 (SEQ ID NO:430). 
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FIGURE 2 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA19902 
xsubunit 1 of 1, 245 aa, 1 stop, 1 unknown 
><MW: -1, pi: 8.36, NX(S/T): 1 

MASPSRRLQTKPVITCFKSVLLIYTFIFWITGVILLAVGIWGKVSLENYFSLLNEKATNVPF 
VLIATGTVIILLGTFGCFATCRASAWMLKLYAMFLTLVFLVELVAAIVGFVFRHEIKNSFKN 
NYEKALKQYNSTGDYRSHAVDKIQNTLHCCGVTDYRDWTDTNYYSEKGFPKSCCKLEDCTPQ 
RDADKVNNEGCFIKVMTIIESEMGWAGISFGVACFQLIGIFLAYCXSRAITNNQYEIV 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

Transmembrane domains: 

amino acids 19-42, 61-83, 92-114, 209-230, 

N-glycosylation site. 

amino acids 134-138 

Tyrosine kinase phosphorylation site. 

amino acids 160-168, 160-169 

N-myristoylation site. 

amino acids 75-81, 78-84, 210-216, 214-220, 226-232 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 69-80, 211-222 
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FIGURE 3 



CCCACGCGTCCGGCGCCGTGGCCTCGCGTCCATCTTTGCCGTTCTCTCGGACCTGTCACAAA 

GGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGGAGGTAGAGAAAGTCAGT 

GCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAACGGGAGGGAGTCTGAGGGT 

TGGGGACGTCTGTGAGGGAGGGGAACAGCCGCTCGAGCCTGGGGCGGGCGGACCGGACTGGG 

GCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGGCGTTGGTCAGAACCTGAGAAACA 

GCCGAGAGGTTTTCCACCGAGGCCCGCGCTTGAGGGATCTGAAGAGGTTCCTAGAAGAGGGT 

GTTCCCTCTTTCGGGGGTCCTCACCAGAAGAGGTTCTTGGGGGTCGCCCTTCTGAGGAGGCT 

GCGGCTAACAGGGCCCAGAACTGCCATTGGATGTCCAGAATCCCCTGTAGTTGATAATGTTG 

GGAATAAGCTCTGCAACTTTCTTTGGCATTCAGTTGTTAAAAACAAATAGGATGCAAATTCC 

TCAACTCCAGGTTATGAAAACAGTACTTGGAAAACTGAAAACTACCTAAATGATCGTCTTTG 

GTTGGGCCGTGTTCTTAGCGAGCAGAAGCCTTGGCCAGGGTCTGTTGTTGACTCTCGAAGAG 

CACATAGCCCACTTCCTAGGGACTGGAGGTGCCGCTACTACCATGGGTAATTCCTGTATCTG 

CCGAGATGACAGTGGAACAGATGACAGTGTTGACACCCAACAGCAACAGGCCGAGAACAGTG 

CAGTACCCACTGCTGACACAAGGAGCCAACCACGGGACCCTGTTCGGCCACCAAGGAGGGGC 

CGAGGACCTCATGAGCCAAGGAGAAAGAAACAAAATGTGGATGGGCTAGTGTTGGACACACT 

GGCAGTAATACGGACTCTTGTAGATAAG TAA GTATCTGACTCACGGTCACCTCCAGTGGAAT 

GAAAAGTGTTCTGCCCGGAACCATGACTTTAGGACTCCTTCAGTTCCTTTAGGACATACTCG 

CCAAGCCTTGTGCTCACAGGGCAAAGGAGAATATTTTAATGCTCCGCTGATGGCAGAGTAAA 

TGATAAGATTTGATGTTTTTGCTTGCTGTCATCTACTTTGTCTGGAAATGTCTAAATGTTTC 

TGTAGCAGAAAACACGATAAAGCTATGATCTTTATTAGAG 
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FIGURE 4 

MIVFGWAVFLASRSLGQGLLLTLEEHIAHFLGTGGAATTMGNSCICRDDSGTDDSVDTQQQQ 
AENSAVPTADTRSQPRDPVRPPRRGRGPHEPRRKKQNVDGLVLDTLAVIRTLVDKO 

Signal peptide: 

amino acids 1-16 

Casein kinase II phosphorylation site. 

amino acids 22-26, 50-54, 113-117 

N-myristoylation site. 

amino acids 18-24, 32-38, 34-40, 35-41, 51-57 
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FIGURE 5 

GGCACGAGGCGCTGTCCACCCGGGGGCGTGGGAGTGAGGTACCAGATTCAGCCCATTTGGCC 
CCGACGCCTCTGTTCTCGGAATCCGGGTGCTGCGGATTGAGGTCCCGGTTCCTAACGGACTG 
CAAGATGGAGGAAGGCGGGAACCTAGGAGGCCTGATTAAGATGGTCCATCTACTGGTCTTGT 
CAGGTGCCTGGGGCATGCAAATGTGGGTGACCTTCGTCTCAGGCTTCCTGCTTTTCCGAAGC 
CTTCCCCGACATACCTTCGGACTAGTGCAGAGCAAACTCTTCCCCTTCTACTTCCACATCTC 
CATGGGCTGTGCCTTCATCAACCTCTGCATCTTGGCTTCACAGCATGCTTGGGCTCAGCTCA 
CATTCTGGGAGGCCAGCCAGCTTTACCTGCTGTTCCTGAGCCTTACGCTGGCCACTGTCAAC 
GCCCGCTGGCTGGAACCCCGCACCACAGCTGCCATGTGGGCCCTGCAAACCGTGGAGAAGGA 
GCGAGGCCTGGGTGGGGAGGTACCAGGCAGCCACCAGGGTCCCGATCCCTACCGCCAGCTGC 
GAGAGAAGGACCCCAAGTACAGTGCTCTCCGCCAGAATTTCTTCCGCTACCATGGGCTGTCC 
TCTCTTTGCAATCTGGGCTGCGTCCTGAGCAATGGGCTCTGTCTCGCTGGCCTTGCCCTGGA 
AATAAGGAGCCTCTAGCATGGGCCCTGCATGCTAATAAATGCTTCTTCAGAAATGAAAAAAA 
AAAAAAAAAAAA 
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FIGURE 6 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA56107 
<subunit 1 of l, 231 aa, 1 stop 
<NX(S/T) : 0 

MEEGGNLGGLIKMVHLLVLSGAWGMQMWVTFVSGFLLFRSLPRHTFGLVQSKLFPFYFHISM 
GCAFINLCILASQHAWAQLTFWEASQLYLLFLSLTLATVNARWLEPRTTAAMWALQTVEKER 
GLGGEVPGSHQGPDPYRQLREKDPKYSALRQNFFRYHGLSSLO^ 

Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 86-103, 60-75 

Casein kinase II phosphorylation site, 

amino acids 82-86 

Tyrosine kinase phosphorylation site. 

amino acids 144-151 

N-myristoylation site. 

amino acids 4-10, 5-11, 47-53, 170-176, 176-182 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 54-65 

G-protein coupled receptors proteins. 

amino acids 44-85 
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FIGURE 7 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTT 

CTTGTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGT 

GCTAGGCCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAG 

ACATCACTGATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCC 

AGAACTTTTGATAAAAAGGG ATTTCATGTAAT CG CTGCCTGTCTGACTGAAT CAGGATCAAC 

AGCTTTAAAGGCAGAAACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAG 

AGAATGTCAAGAGGACTGCCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGT 

CTGATCAATAATGCTGGTGTTCCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGA 

CTACAGAGAACCTATTGAAGTGAACCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTC 

CTTTGGTCAAGAAAGCTCAAGGGAGAGTTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCA 

ATCGTTGGAGGGGGCTATACTCCATCCAAATATGCAGTGGAAGGTTTCAATGACAGCTTAAG 

ACGGGACATGAAAGCTTTTGGTGTGCACGTCTCATGCATTGAACCAGGATTGTTCAAAACAA 

ACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAACTCGCCATTTGGGAGCAGCTGTCTCCA 

GACATCAAACAACAATATGGAGAAGGTTACATTGAAAAAAGTCTAGACAAACTGAAAGGCAA 

TAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTGCATGGACCACGCTCTAACAA 

GTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAATTTTCTGGATACCTCTG 

TCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAAGCAGAGCTGGCTAA 

TCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGAAATTGGCCGAT 

TTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACTCATTTAGA 

TCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGTCCCTG 

CTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCTGT 

ATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATG 

ATCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTT 

AAGTATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56406 
xsubunit 1 of 1, 319 aa, 1 stop 
><MW: 35227, pi: 8.97, NX(S/T): 3 

MLFWVLGLLI LCGFLWTRKGKLKI ED ITDKY I F I TGCDSGFGNLAARTFDKKGFHVI AACLT 
ESGSTALKAETSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDW 
LTLEDYREPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLALVGGGYTPSKYAVEGF 
NDSLRRDMKAFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLD 
KLKGNKSYVNMDLSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQK 
AELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 13 6-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 9 

GCGGGCTGTTGACGGCGCTGCGATGGCTGCCTGCGAGGGCAGGAGAAGCGGAGCTCTCGGTT 

CCTCTCAGTCGGACTTCCTGACGCCGCCAGTGGGCGGGGCCCCTTGGGCCGTCGCCACCACT 

GTAGTCATGTACCCACCGCCGCCGCCGCCGCCTCATCGGGACTTCATCTCGGTGACGCTGAG 

CTTTGGCGAGAGCTATGACAACAG CAAGAGTTGG CGG CGGCG CTCGTG CTGGAGG AAATGGA 

AGCAACTGTCGAGATTGCAGCGGAATATGATTCTCTTCCTCCTTGCCTTTCTGCTTTTCTGT 

GGACTCCTCTTCTACATCAACTTGGCTGACCATTGGAAAGCTCTGGCTTTCAGGCTAGAGGA 

AGAGCAGAAGATGAGGCCAGAAATTGCTGGGTTAAAACCAGCAAATCCACCCGTCTTACCAG 

CTCCTCAGAAGGCGGACACCGACCCTGAGAACTTACCTGAGATTTCGTCACAGAAGACACAA 

AGACACATCCAGCGGGGACCACCTCACCTGCAGATTAGACCCCCAAGCCAAGACCTGAAGGA 

TGGGACCCAGGAGGAGGCCACAAAAAGGCAAGAAGCCCCTGTGGATCCCCGCCCGGAAGGAG 

AT CCGCAGAGGACAGT CATCAG CTGGAGGGGAGCGGTGATCGAGCCTGAGCAGGGCAC CGAG 

CTCCCTTCAAGAAGAGCAGAAGTGCCCACCAAGCCTCCCCTGCCACCGGCCAGGACACAGGG 

CACACCAGTGCATCTGAACTATCGCCAGAAGGGCGTGATTGACGTCTTCCTGCATGCATGGA 

AAGG ATAC CG CAAGTTTG CATGGGGCCATGACGAG CTGAAGCCTGTGT CC AGGT CC TTCAGT 

GAGTGGTTTGGCCTCGGTCTCACACTGATCGACGCGCTGGACACCATGTGGATCTTGGGTCT 

GAGGAAAGAATTTGAGGAAGCCAGGAAGTGGGTGTCGAAGAAGTTACACTTTGAAAAGGACG 

TGGACGTCAACCTGTTTGAGAGCACGATCCGCATCCTGGGGGGGCTCCTGAGTGCCTACCAC 

CTGTCTGGGGACAGCCTCTTCCTGAGGAAAGCTGAGGATTTTGGAAATCGGCTAATGCCTGC 

CTTCAGAACACCATCCAAGATTCCTTACTCGGATGTGAACATCGGTACTGGAGTTGCCCACC 

CGCCACGGTGGACCTCCGACAGCACTGTGGCCGAGGTGACCAGCATTCAGCTGGAGTTCCGG 

GAGCTCTCCCGTCTCACAGGGGATAAGAAGTTTCAGGAGGCAGTGGAGAAGGTGACACAGCA 

CATCCACGGCCTGTCTGGGAAGAAGGATGGGCTGGTGCCCATGTTCATCAATACCCACAGTG 

GCCTCTTCACCCACCTGGGCGTATTCACGCTGGGCGCCAGGGCCGACAGCTACTATGAGTAC 

CTGCTGAAGCAGTGGATCCAGGGCGGGAAGCAGGAGACACAGCTGCTGGAAGACTACGTGGA 

AGCCATCGAGGGTGTCAGAACGCACCTGCTGCGGCACTCCGAGCCCAGTAAGCTCACCTTTG 

TGGGGGAGCTTGCCCACGGCCGCTTCAGTGCCAAGATGGACCACCTGGTGTGCTTCCTGCCA 

GGGACGCTGGCTCTGGGCGTCTACCACGGCCTGCCCGCCAGCCACATGGAGCTGGCCCAGGA 

GCTCATGGAGACTTGT TACC AGATGAACCGGCAGATGGAGACGGGGCTGAGT CCCGAGAT CG 

TGCACTTCAACCTTTACCCCCAGCCGGGCCGTCGGGACGTGGAGGTCAAGCCAGCAGACAGG 

CACAACCTGCTGCGGCCAGAGACCGTGGAGAGCCTGTTCTACCTGTACCGCGTCACAGGGGA 

CCGCAAATAC CAGGACTGGGGCTGGGAGATTCTGCAGAGCTTCAG C CGATTCACACGGGTCC 

CCTCGGGTGGCTATTCTTCCATCAACAATGTCCAGGATCCTCAGAAGCCCGAGCCTAGGGAC 

AAGATGGAGAGCTTCTTCCTGGGGGAGACGCTCAAGTATCTGTTCTTGCTCTTCTCCGATGA 

CCCAAACCTGCTCAGCCTGGACGCCTACGTGTTCAACACCGAAGCCCACCCTCTGCCTATCT 

GGACCCCTGCCTAGGGTGGATGGCTGCTGGTGTGGGGACTTCGGGTGGGCAGAGGCACCTTG 

CTGGGTCTGTGGCATTTTCCAAGGGCCCACGTAGCACCGGCAACCGCCAAGTGGCCCAGGCT 

CTGAACTGGCTCTGGGCTCCTCCTCGTCTCTGCTTTAATCAGGACACCGTGAGGACAAGTGA 

GGCCGTCAGTCTTGGTGTGATGCGGGGTGGGCTGGGCCGCTGGAGCCTCCGCCTGCTTCCTC 

CAGAAGACACGAATCATGACTCACGATTGCTGAAGCCTGAGCAGGTCTCTGTGGGCCGACCA 

GAGGGGGGCTTCGAGGTGGTCCCTGGTACTGGGGTGACCGAGTGGACAGCCCAGGGTGCAGC 

TCTGCCCGGGCTCGTGAAGCCTCAGATGTCCCCAATCCAAGGGTCTGGAGGGGCTGCCGTGA 

CTCCAGAGGCCTGAGGCTCCAGGGCTGGCTCTGGTGTTTACAAGCTGGACTCAGGGATCCTC 

CTGGCCGCCCCGCAGGGGGCTTGGAGGGCTGGACGGCAAGTCCGTCTAGCTCACGGGCCCCT 

CCAGTGGAATGGGTCTTTTCGGTGGAGATAAAAGTTGATTTGCTCTAACCGCAA 
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FTGURE 10 

></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA56529 
xsubunit 1 of 1, 699 aa, 1 stop 
><MW: 79553, pi: 7.83, NX{S/T): 0 

MAACEGRRSGALGS SQSDFLTP PVGGAPWAVATTWMYPP PP P P PHRD F I S VTLS FGESYDN 
SKSWRRRSCWRKWKQLSRLQRNMILFLLAFLLFCGLLFYINLADHWKALAFRLEEEQKMRPE 
IAGLKPANPPVLPAPQKADTDPENLPEISSQKTQRHIQRGPPHLQIRPPSQDLKDGTQEEAT 
KRQEAPVDPRPEGDPQRTVISWRGAVIEPEQGTELPSRRAEVPTKPPLPPARTQGTPVHLNY 
RQKGVIDVFLHAWKGYRKFAWGHDELKPVSRSFSEWFGLGLTLIDALDTMWILGLRKEFEEA 
RKWVSKKLHFEKDVDVNLFESTIRILGGLLSAYHLSGDSLFLRKAEDFGNRLMPAFRTPSKI 
PYSDVNIGTGVAHPPRWTSDSTVAEVTSIQLEFRELSRLTGDKKFQEAVEKVTQHIHGLSGK 
KDGLVPMFINTHSGLFTHLGVFTLGARADSYYEYLLKQWIQGGKQETQLLEDYVEAIEGVRT 
HLLRHSEPSKLTFVGELAHGRFSAKMDHLVCFLPGTLALGVYHGLPASHMELAQELMETCYQ 
MNRQMETGLSPEIVHFNLYPQPGRRDVEVKPADRHNLLRPETVESLFYLYRVTGDRKYQDWG 
WEILQSFSRFTRVPSGGYSSINNVQDPQKPEPRDKMESFFLGETLKYLFLLFSDDPNLLSLD 

AYVFNTEAHPLP I WTPA 

Important features of the protein: 
Transmembrane domain: 

amino acids 21-40 and 84-105 (type II) 



WO 00/12708 



PCT/US99/20111 



FIGURE 11 



GGCGCCGCGTAGGCCCGGGAGGCCGGGCCGGCCGGGCTGCGAGCGCCTGCCCCATGCGCCGC 

CGCCTCTCCGCACGATGTTCCCCTCGCGGAGGAAAGCGGCGCAGCTGCCCTGGGAGGACGGC 

AGGTCCGGGTTGCTCTCCGGCGGCCTCCCTCGGAAGTGTTCCGTCTTCCACCTGTTCGTGGC 

CTGCCTCTCGCTGGGCTTCTTCTCCCTACTCTGGCTGCAGCTCAGCTGCTCTGGGGACGTGG 

CCCGGGCAGTCAGGGGACAAGGGCAGGAGACCTCGGGCCCTCCCCGTGCCTGCCCCCCAGAG 

CCGCCCCCTGAGCACTGGGAAGAAGACGCATCCTGGGGCCCCCACCGCCTGGCAGTGCTGGT 

GCCCTTCCGCGAACGCTTCGAGGAGCTCCTGGTCTTCGTGCCCCACATGCGCCGCTTCCTGA 

GCAGGAAGAAGATCCGGCACCACATCTACGTGCTCAACCAGGTGGACCACTTCAGGTTCAAC 

CGGGCAGCGCTCATCAACGTGGGCTTCCTGGAGAGCAGCAACAGCACGGACTACATTGCCAT 

GCACGACGTTGACCTGCTCCCTCTCAACGAGGAGCTGGACTATGGCTTTCCTGAGGCTGGGC 

CCTTCCACGTGGCCTCCCCGGAGCTCCACCCTCTCTACCACTACAAGACCTATGTCGGCGGC 

ATCCTGCTGCTCTCCAAGCAGCACTACCGGCTGTGCAATGGGATGTCCAACCGCTTCTGGGG 

CTGGGGCCGCGAGGACGACGAGTTCTACCGGCGCATTAAGGGAGCTGGGCTCCAGCTTTTCC 

GCCCCTCGGGAATCACAACTGGGTACAAGACATTTCGCCACCTGCATGACCCAGCCTGGCGG 

AAGAGGGACCAGAAGCGCATCGCAGCTCAAAAACAGGAGCAGTTCAAGGTGGACAGGGAGGG 

AGGCCTGAACACTGTGAAGTACCATGTGGCTTCCCGCACTGCCCTGTCTGTGGGCGGGGCCC 

CCTGCACTGTCCTCAACATCATGTTGGACTGTGACAAGACCGCCACACCCTGGTGCACATTC 

AGCTGAGCTGGATGGACAGTGAGGAAGCCTGTACCTACAGGCCATATTGCTCAGGCTCAGGA 

CAAGGCCTCAGGTCGTGGGCCCAGCTCTGACAGGATGTGGAGTGGCCAGGACCAAGACAGCA 

AGCTACGCAATTGCAGCCACCCGGCCGCCAAGGCAGGCTTGGGCTGGGCCAGGACACGTGGG 

GTGCCTGGGACGCTGCTTGCCATGCACAGTGATCAGAGAGAGGCTGGGGTGTGTCCTGTCCG 

GGACCCCCCCTGCCTTCCTGCTCACCCTACTCTGACCTCCTTCACGTGCCCAGGCCTGTGGG 

TAGTGGGGAGGGCTGAACAGGACAACCTCTCATCACCCTACTCTGACCTCCTTCACGTGCCC 

AGGCCTGTGGGTAGTGGGGAGGGCTGAACAGGACAACCTCTCATCACCCCCAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 12 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56531 
xsubunit 1 of 1, 327 aa, 1 stop 
><MW: 37406, pi: 9.30, NX(S/T): 1 

MFPSRRKAAQLPWEDGRSGLLSGGLPRKCSVFHLFVACLSLGFFSLLWLQLSCSGDVARAVR 
GQGQETSGPPRACPPEPPPEHWEEDASWGPHRLAVLVPFRERFEELLVFVPHMRRFLSRKKI 
RHHIYVLNQVDHFRFNRAALINVGFLESSNSTDYIAMHDVDLLPLNEELDYGFPEAGPFHVA 
SPELHPLYHYKTYVGGILLLSKQHYRLCNGMSNRFWGWGREDDEFYRRIKGAGLQLFRPSGI 
TTGYKTFRHLHDPAWRKRDQKRIAAQKQEQFKVDREGGLNTVKYKVASRTALSVGGAPCTVL 
NIMLDCDKTATPWCTFS 

Signal peptide: 

amino acids 1-42 

Transmembrane domain; 

amino acids 29-49 (type II) 

N-glycosylation site. 

amino acids 154-158 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 27-31 

Tyrosine kinase phosphorylation site. 

amino acids 226-233 

N-myristoylation site. 

amino acids 19-25, 65-71, 247-253, 285-291, 303-309, 304-310 
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FIGURE 13 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCT 
GCTGCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCT 
CTTCTGCCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTC 
CTGCCTCATCGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGT 
GGGGGATGGCTAAGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGG 
GCTG CCTTATTT AAAGTGGTTGTTTATGATTCTTATACTAATTTATACAAAGATATTAAGG C 
CCTGTTCATTAAGAAATTGTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGT 
AAATATGTCTTTATAATAAACAGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 00/12708 



PO7US99/20I11 



FIGURE 14 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56862 
<subunit 1 of 1, 73 aa, 1 stop 
<MW: 7879, pi: 7.21, NX(S/T): 0 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQ 
PRGEGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 15 

GGGACCCATGCGGCCGTGACCCCCGGCTCCCTAGAGGCCCAGCGCAGCCGCAGCGGACAAAG 
GAGCATGTCCGCGCCGGGGAAGGCCCGTCCTCCGGCCGCCATAAGGCTCCGGTCGCCGCTGG 
GCCCGCGCCGCGCTCCTGCCCGCCCGGGCTCCGGGGCGGCCCGCTAGGCCAGTGCGCCGCCG 
CTCGCCCCGCAGGCCCCGGCCCGCAGCATGGAGCCACCCGGACGCCGGCGGGGCCGCGCGCA 
GCCGCCGCTGTTGCTGCCGCTCTCGCTGTTAGCGCTGCTCGCGCTGCTGGGAGGCGGCGGCG 
GCGGCGGCGCCGCGGCGCTGCCCGCCGGCTGCAAGCACGATGGGCGGCCCCGAGGGGCTGGC 
AGGGCGGCGGGCGCCGCCGAGGGCAAGGTGGTGTGCAGCAGCCTGGAACTCGCGCAGGTCCT 
GCCCCCAGATACTCTGCCCAACCGCACGGTCACCCTGATTCTGAGTAACAATAAGATATCCG 
AGCTGAAGAATGGCTCATTTTCTGGGTTAAGTCTCCTTGAAAGATTGGACCTCCGAAACAAT 
CTTATTAGTAGTATAGATCCAGGTGCCTTCTGGGGACTGTCATCTCTAAAAAGATTGGATCT 
GACAAACAATCGAATAGGATGTCTGAATGCAGACATATTTCGAGGACTCACCAATCTGGTTC 
GGCTAAACCTTTCGGGGAATTTGTTTTCTTCATTATCTCAAGGAACTTTTGATTATCTTGCG 
TCATTACGGTCTTTGGAATTCCAGACTGAGTATCTTTTGTGTGACTGTAACATACTGTGGAT 
GCATCGCTGGGTAAAGGAGAAGAACATCACGGTACGGGATACCAGGTGTGTTTATCCTAAGT 
CACTGCAGGCCCAACCAGTCACAGGCGTGAAGCAGGAGCTGTTGACATGCGACCCTCCGCTT 
GAATTGCCGTCTTTCTACATGACTCCATCTCATCGCCAAGTTGTGTTTGAAGGAGACAGCCT 
TC CTTT CCAGTG C ATGGCTTCATATATTGATCAGGACATGCAAGTGTTGTGGTATCAGGATG 
GGAGAATAGTTGAAAC CGATGAATCGCAAGGTAT TTTTGTTGAAAAGAACATGATTCACAAC 
TGCTCCTTGATTGCAAGTGCCCTAACCATTTCTAATATTCAGGCTGGATCTACTGGAAATTG 
GGGCTGTCATGTCCAGACCAAACGTGGGAATAATACGAGGACTGTGGATATTGTGGTATTAG 
AGAGTTCTGCAC AGTACTGT CCTC CAGAGAGGGTGGTAAACAACAAAGGTGACTTCAGATGG 
CCCAGAACATTGGCAGGCATTACTGCATATCTGCAGTGTACGCGGAACACCCATGGCAGTGG 
GATATATCCCGGAAACCCACAGGATGAGAGAAAAGCTTGGCGCAGATGTGATAGAGGTGGCT 
TTTGGGCAGATGATGATTATTCTCGCTGTCAGTATGCAAATGATGTCACTAGAGTTCTTTAT 
ATGTTTAATCAGATGCCCCTCAATCTTACCAATGCCGTGGCAACAGCTCGACAGTTACTGGC 
TTACACTGTGGAAGCAGCCAACTTTTCTGACAAAATGGATGTTATATTTGTGGCAGAAATGA 
TTGAAAAATTTGGAAGATTTACCAAGGAGGAAAAATCAAAAGAGCTAGGTGACGTGATGGTT 
GACATTGCAAGTAACATCATGTTGGCTGATGAACGTGTCCTGTGGCTGGCGCAGAGGGAAGC 
TAAAGCCTGCAGTAGGATTGTGCAGTGTCTTCAGCGCATTGCTACCTACCGGCTAGCCGGTG 
GAGCTCACGTTTATTCAACATATTCACCCAATATTGCTCTGGAAGCTTATGTCATCAAGTCT 
ACTGGCTTCACGGGGATGACCTGTACCGTGTTCCAGAAAGTGGCAGCCTCTGATCGTACAGG 
ACTTTCGGATTATGGGAGGCGGGATCCAGAGGGAAACCTGGATAAGCAGCTGAGCTTTAAGT 
GCAATGTTTCAAATACATTTTCGAGTCTGGCACTAAAGGTATGTTACATTCTGCAATCATTT 
AAGACTATTTACAGTTAAATTAGAATGCTCCAAATGTTCTGCTTCGCAAAATAACCTTATTA 
AAAGATTTTTTTTTGCAGGAAGATAGGTATTATTGCTTTTGCTACTGTTTTAAAGAAAACTA 
ACCAGGAAGAACTGCATTACGACTTTCAAGGGCCCTAGGCATTTTTGCCTTTGATTCCCTTT 
CTTCACATAAAAATATCAGAAATTACATTTTATAACTGCAGTGGTATAAATGCAAATATACT 
ATTGTTACATGTGAAAAAATTTTATTTGACTTAAAAGTTTATTTATTTGTTTTTTTGCTCCT 
GATTTTAAGACAATAAGATGTTTTCATGGGCCCCTAAAAGTATCATGAGCCTTTGGCACTGC 
GCCTGCCAAGCCTAGTGGAGAAGTCAACCCTGAGACCAGGTGTTTAATCAAGCAAGCTGTAT 
ATCAAAATTTTTGGCAGAAAACACAAATATGTCATATATCTTTTTTTAAAAAAAGTATTTCA 
TTGAAGCAAGCAA^TGAAAGCATTTTTACTGATTTTTAAAATTGGTGCTTTAGATATATTT 
GACTACACTGTATTGAAGCAAATAGAGGAGGCACAACTCCAGCACCCTAATGGAACCACATT 
TTTTTCACTTAGCTTTCTGTGGGCATGTGTAATTGTATTCTCTGCGGTTTTTAATCTCACAG 
TACTTTATTTCTGTCTTGTCCCTCAATAATATCACAAACAATATTCCAGTCATTTTAATGGC 
TGCATAATAACTGATCCAACAGGTGTTAGGTGTTCTGGTTTAGTGTGAGCACTCAATAAATA 
TTGAATGAATGAACGAAAAAAAAAAAAAAAA 

/5 
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FIGURE 16 

MEPPGRRRGRAQPPLLLPLSLUUjLALLGGGGGGGAAALPAGCKHDGRPRGAGRAAGAAEGK 
WCSSLELAQVLPPDTLPNRT VTL I LSNNK I S ELKNGS FSGLSLLERLDLRNNL I S S I DPGA 
FWGLSSLKRLDLTNNRIGCLNADIFRGLTNLVRLNLSGNLFSSLSQGTFDYLASLRSLEFQT 
EYLLCDCNILWMHRWVKEKNITVRDTRCVYPKSLQAQPVTGVKQELLTCDPPLELPSFYMTP 
SHRQWFEGDSLPFQCMASYIDQDMQVLWYQDGRIVETDESQGIFVEKNMIHNCSLIASALT 
ISNIQAGSTGNWGCHVQTKRGNNTRTVDIWLESSAQYCPPERWNNKGDFRWPRTIoAGITA 
YLQCTRNTHGSGIYPGNPQDERKAWRRCDRGGFWADDDYSRCQYANDVTRVLYMFNQMPLNL 
TNAVATARQLLAYTVE AANFSDKMDVI FVAEMIEKFGRFTKEEKSKELGDVMVD I ASNIMLA 
DERVLWLAQREAKACSR I VQCLQRI ATYRLAGGAHVYSTYSPN I ALEAYVI KSTGFTGMTCT 
VFQKVAASDRTGLSDYGRRDPEGNLDKQLSFKCNVSNTFSSLALKVCYILQSFKTIYS 

Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 13-40 (type II) 

N-glycosylation site. 

amino acids 81-85, 98-102, 159-163, 206-210, 301-305, 332-336, 
433-437, 453-457, 592-596 

N-myristoylation site. 

amino acids 29-35, 30-36, 31-37, 32-38, 33-39, 34-40, 51-57, 
57-63, 99-105, 123-129, 142-148, 162-168, 317-323, 320-326, 
384-390, 403-409, 554-560 
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FIGURE 17 

GCGTGGGGATGTCTAGGAGCTCGAAGGTGGTGCTGGGCCTCTCGGTGCTGCTGACGGCGGCC 
ACAGTGGCCGGCGTACATGTGAAGCAGCAGTGGGACCAGCAGAGGCTTCGTGACGGAGTTAT 
CAGAGACATTGAGAGGCAAATTCGGAAAAAAGAAAACATTCGTCTTTTGGGAGAACAGATTA 
TTTTGACTGAGCAACTTGT^AGCAGAAAGAGAGAAGATGTTATTGGCAAAAGGATCTCAAAAA 
TCATGACTTGAATGTGAAATATCTGTTGGACAGACAACACGAGTTTGTGTGTGTGTGTTGAT 
GGAGAGTAGCTTAGTAGTATCTTCATCTTTTTTTTTGGTCACTGTCCTTTTAAACTTGATCA 
AATAAAGGACAGTGGGTCATATAAGTTACTGCTTTCAGGGTCCCTTATATCTGAATAAAGGA 
GTGTGGGCAGACACTTTTTGGAAGAGTCTGTCTGGGTGATCCTGGTAGAAGCCCCATTAGGG 
TCACTGTCCAGTGCTTAGGGTTGTTACTGAGAAGCACTGCCGAGCTTGTGAGAAGGAAGGGA 
TGGATAGTAGCATCCACCTGAGTAGTCTGATCAGTCGGCATGATGACGAAGCCACGAGAACA 
TCGACCTCAGAAGGACTGGAGGAAGGTGAAGTGGAGGGAGAGACGCTCCTGATCGTCGAATCC 
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FIGURE 18 

MS RS SKWLGLS VLLTAATVAGVHVKQQWDQQRLRDG VI RD I ERQI RKKENI RLLGEQ 1 1 LT 
EQLEAEREKMLLAKGSQKS 

Signal peptide: 

amino acids 1-21 
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FIGURE 19 

CTGTCGTCTTTGCTTCAGCCGCAGTCGCCACTGGCTGCCTGAGGTGCTCTTACAGCCTGTTC 
CAAGTGTGGCTTAATCCGTCTCCACCACCAGATCTTTCTCCGTGGATTCCTCTGCTAAGACC 
GCTGCCATgCCAGTGACGGTAACCCGCACCACCATCACAACCACCACGACGTCATCTTCGGG 
CCTGGGGTCCCCCATGATCGTGGGGTCCCCTCGGGCCCTGACACAGCCCCTGGGTCTCCTTCGC 
CTGCTGCAGCTGGTGTCTACCTGCGTGGCCTTCTCGCTGGTGGCTAGCGTGGGCGCCTGGAC 
GGGGTCCATGGGCAACTGGTCCATGTTCACCTGGTGCTTCTGCTTCTCCGTGACCCTGATCA 
TCCTCATCGTGGAGCTGTGCGGGCTCCAGGCCCGCTTCCCCCTGTCTTGGCGCAACTTCCCC 
ATCACCTTCGCCTGCTATGCGGCCCTCTTCTGCCTCTCGGCCTCCATCATCTACCCCACCAC 
CTATGTCCAGTTCCTGTCCCACGGCCGTTCGCGGGACCACGCCATCGCCGCCACCTTCTTCT 
CCTGCATCGCGTGTGTGGCTTACGCCACCGAAGTGGCCTGGACCCGGGCCCGGCCCGGCGAG 
ATCACTGGCTATATGGCCACCGTACCCGGGCTGCTGAAGGTGCTGGAGACCTTCGTTGCCTG 
CATCATCTTCGCGTTCATCAGCGACCCCAACCTGTACCAGCACCAGCCGGCCCTGGAGTGGT 
GCGTGGCGGTGTACGCCATCTGCTTCATCCTAGCGGCCATCGCCATCCTGCTGAACCTGGGG 
GAGTGCACCAACGTGCTACCCATCCCCTTCCCCAGCTTCCTGTCGGGGCTGGCCTTGCTGTC 
TGTCCTCCTCTATGCCACCGCCCTTGTTCTCTGGCCCCTCTACCAGTTCGATGAGAAGTATG 
GCGGCCAGCCTCGGCGCTCGAGAGATGTAAGCTGCAGCCGCAGCCATGCCTACTACGTGTGT 
GCCTGGGACCGCCGACTGGCTGTGGCCATCCTGACGGCCATCAACCTACTGGCGTATGTGGC 
TGACCTGGTGCACTCTGCCCACCTGGTTTTTGTCAAGGTCTAAGACTCTCCCAAGAGGCTCC 
CGTTCCCTCTCCAACCTCTTTGTTCTTCTTGCCCGAGTTTTCTTTATGGAGTACTTCTTTCC 
TCCGCCTTTCCTCTGTTTTCCTCTTCCTGTCTCCCCTCCCTCCCACCTTTTTCTTTCCTTCC 
CAATTCCTTGCACTCTAACCAGTTCTTGGATGCATCTTCTTCCTTCCCTTTCCTCTTGCTGT 
TTCCTTCCTGTGTTGTTTTGTTGCCCACATCCTGTTTTCACCCCTGAGCTGTTTCTCTTTTT 
CTTTTCTTTCTTTTTTTTTTTTTTTTTTAAGACGGATTCTCACTCTGTGGCCCAGGCTGGAG 
TGCAGTGGTGCGATCTCAGCTCACTGCAACCCCCGCCTCCTGGGTTCAAGCGATTCTCCTCC 
CCCAGCCTCCCAAGTAGCTGGGAGGACAGGTGTGAGCTGCCGCACCCAGCCTGTTTCTCTTT 
TTCCACTCTTCTTTTTTCTCATCTCTTTTCTGGGTTGCCTGTCGGCTTTCTTATCTGCCTGT 
TTTGCAAGCACCTTCTCCTGTGTCCTTGGGAGCCCTGAGACTTCTTTCTCTCCTTGCCTCCA 
CCCACCTCCAAAGGTGCTGAGCTCACATCCACACCCCTTGCAGCCGTCCATGCCACAGCCCC 
CCAAGGGGCCCCATTGCCAAAGCATGCCTGCCCACCCTCGCTGTGCCTTAGTCAGTGTGTAC 
GTGTGTGTGTGTGTGTGTTTGGGGGGTGGGGGGTGGGTAGCTGGGGATTGGGCCCTCTTTCT 
CCCAGTGGAGGAAGGTGTGCAGTGTACTTCCCCTTTAAATTAAAAAACATATATATATATAT 
ATTTGGAGGTCAGTAATTTCCAATGGGCGGGAGGCATTAAGCACCGACCCTGGGTCCCTAGG 
CCCCGCCTGGCACTCAGCCTTGCCAGAGATTGGCTCCAGAATTTTTGCCAGGCTTACAGAACAC 
CCACTGCCTAGAGGCCATCTTAAAGGAAGCAGGGGCTGGATGCCTTTCATCCCAACTATTCT 
CTGTGGTATGAAAAAG 
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FIGURE 20 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA58 727 
<subunit 1 of 1, 322 aa, 1 stop 
<MW: 35274, pi: 8.57, NX(S/T): 1 

MPVTVTRTTITTTTTSSSGLGSPMIVGSPRALTQPLGLLRLLQLVSTCVAFSLVASVGAWTG 
SMGNWSMFTWCFCFSVTLIILIVELCGLQARFPLSWRNFPITFACYAALFCLSASIIYPTTY 
VQFLSHGRSRDHAI AATFFSC I ACVAYATEVAWTRARPGE ITGYMATVPGLLKVLETFVAC I 
IFAFISDPNLYQHQPALEWCVAVYAICFILAAIAILLNLGECTNVLPIPFPSFLSGLALLSV 
LLYATALVLWPLYQFDEKYGGQPRRSRDVSCSRSHAYYVCAWDRRLAVAILTAINLLAYVAD 
LVHSAHLVFVKV 

Important features: 
Transmembrane domains: 

amino acids 41-60 (type II), 66-85, 101-120, 137-153, 171-192, 
205-226, 235-255 and 294-312 

N-glycosylation site. 

amino acids 66-69 

Glycosaminoglycan attachment site. 

amino acids 18-21 
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FIGURE 21 

GAACGTGCCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTC 
TTGAACTCGTGACCTCATGATCCGCTCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGACGC 
CTGGCCAGCCTATGCATTTTTAAGAAATTATTCTGTATTAGGTGCTGTGCTAAACATTGGGCACTACAGTGACCA 
AAACAGACTGAATTCCCCAAGAGCCAAAGACCAGTGAGGGAGACCAACAAGAAACAGGAAATGCAAAAGAGACCA 
TTATTACTCACTATGACTAAGGGTCACAAATGGGGTACGTTGATGGAGAGTGATTTGTTAAGAGACTACAGAGGG 
AGGACAGACTACCAAGAGGGGGGCCAGGAAAGCTCCTCTGACGAGGTGGTATTTCAGCCCAAACTGGAAGAATGA 
GAAAGAGCTAGCCAGCCATCAGAATAGTCGAGAAGAGATGGGGAGCACTA(^CTC^CTACACTTTGGCCTGAGAA 
AATAGCATGGGATTGGAGGAGGCTGGGGGAACACCACTTCTGCCGACCTGGGCAGGAGGCATTGAGGGCTTGAGA 
AAGGGCAATGGCAGTAGCAGTAGAAAGG ACAGGGTAGG AG CAGGGACTT TGCAGGTG G AATCATTAGGTCTTATC 
AACAGATATGGGCAAGCAAAGCCAGGGGAGAATTGATGGTAATGCTGAGGTTTGGAGCCAGGCTAGATGGGACAG 
TGGTGGGTGATGCAAAGGAAAGAGGTCAGGAAGCAGGGCCAGACGTGGGGAGAAGGTGTGGGGGTTTGGTTTCCA 
TCTTGCCGAGTCTGCCGGAATGTGGATGGGAAGACCAAGAGGAGGAGCAAGGGGCAGAGGGGAAGGGAATCTTAA 
AGAAGTCCTGGATGCCACACTCTTCTTCCTTCCTCCTCTTCCCTCTCCTCAGAGGTCTCACTCGTGGTTCTTCAT 
TTCCTGCCCTGCCTCCATCTCCTCTGGGTGCTGGGAAAGTGGAGGATTAGCTGAAGTTTTGCTTCTCGGGGCCTG 
TCTGAATCTCCATTGCTTTCTGGGAGGACATAATTCACCTGTCCTAGCTTCTTATCATCTTACATTTCCCTGTAG 
CCACTGGGACATATGTGGTGTTCCTTCCTAGCTCCTGTCTCCTCCTCATGCCTTTGCTGGGTATGGGCATGTTAG 
GGGGAAGGTCATTGCTGTCAGAGGGGCACTGACTTTCTAATGGTGTTACCCAAGGTGAATGTTGGAGACACAGTC 
GCGATGCTGCCCAAGTCCCGGCGAGCCCTAACTATCCAGGAGATCGCTGCGCTGGCCAGGTCCTCCCTGCATGGT 
ATGCAGCCCCTCCCATGTTTCTGGCCACTTTGTCCTTTCTCCTCCCGTTTGCACATCCCTTTGGAACTGTTTCCT 
GTGAGTACATGCTGGGGTCTCCCCTTTCTTCCCTTGCTCAGGTGAATCTCAGCCCCTTCTCCCACCCAAAGGTTC 
ACATGGATCCTAACTACTGCCACCCTTCCACCTCCCTGCACCTGTGCTCCCTGGCCTGGTCCTTTACCAGGCTTC 
TCCACCCTCCCCTATCTCCAGGTATTTCCCAGGTGGTGAAGGACCACGTGACCAAGCCTACCGCCATGGCCCAGG 
GCCGAGTGGCTCACCTCATTGAGTGGAAGGGCTGGAGCAAGCCGAGTGACTCACCTGCTGCCCTGGAATCAGCCT 
TTTCCTCCTATTCAGACCTCAGCGAGGGCGAACAAGAGGCTCGCTTTGCAGCAGGAGTGGCTGAGCAGTTTGCCA 
TCGCGGAAGCCAAGCTCCGAGCATGGTCTTCGGTGGATGGCGAGGACTCCACTGATGACTCCTATGATGAGGACT 
TTGCTGGGGGAATGGACACAGACATGGCTGGGCAGCTGCCCCTGGGGCCGCACCTCCAGGACCTGTTCACCGGCC 
ACCGGTTCTCCCGGCCTGTGCGCCAGGGCTCCGTGGAGCCTGAGAGCGACTGCTCACAGACCGTGTCCCCAGACA 
CCCTGTGCTCTAGTCTGTGCAGCCTGGAGGATGGGTTGTTGGGCTCCCCGGCCCGGCTGGCCTCCCAGCTGCTGG 
GCGATGAGCTGCTTCTCGCCAAACTGCCCCCCAGCCGGGAAAGTGCCTTCCGCAGCCTGGGCCCACTGGAGGCCC 
AGGACTCACTCTACAACTCGCCCCTCACAGAGTCCTGCCTTTCCCCCGCGGAGGAGGAGCCAGCCCCCTGCAAGG 
ACTGCCAGCCACTCTGCCCACCACTAACGGGCAGCTGGGAACGGCAGCGGCAAGCCTCTGACCTGGCCTCTTCTG 
GGGTGGTGTCCTTAGATGAGGATGAGGCAGAGCCAGAGGAACAGTGACCCACATCATGCCTGGCAGTGGCATGCA 
TCCCCCGGCTGCTGCCAGGGGCAGAGCCTCTGTGCCCAAGTGTGGGCTCAAGGCTCCCAGCAGAGCTCCACAGCC 
TAGAGGGCTCCTGGGAGCGCTCGCTTCTCCGTTGTGTGTTTTGCATGAAAGTGTTTGGAGAGGAGGCAGGGGCTG 
GGCTGGGGGCGCATGTCCTGCCCCCACTCCCGGGGCTTGCCGGGGGTTGCCCGGGGCCTCTGGGGCATGGCTACA 
GCTGTGGCAGACAGTGATGTTCATGTTCTTAAAATGCC^CACACACATTTCCTCCTCGGATAATGTGAACCACTA 
AGGGGGTTGTGACTGGGCTGTGTG AGGGTGGGGTGGGAGGGGG CC CAGCAACCCCCC ACCCTCCCCATGCCT CTC 
TCTTCTCTGCTTTTCTTCTCACTTCCGAGTCCAT^ 

CTCCTGCCCTCCCGGAGCCTATGGGTTGAGCCGTCCCTCAAGGGCCCCTGCCCAGCrGGGCTCGTGCTGTGCTTC 
ATTCACCTCTCCATCGTCTCTAAATCTTCC 

GGATCTTCTCTTCn7CTGGGAGGCTTTGGAATGATGAAAGCATGTACCCTCCACCCTTTTCCTGGCCCCCTAATGG 
GGCCTGGGCCCTTTCCCAACCCCTCCTAGGATGTGCGGGCAGTGTGCTGGCGCCTCACAGCCAGCCGGGCTGCCC 
ATTCACGCAGAGCTCTCTGAGCGGGAGGTGGAAGAAAGGATGGCTCTGGTTGCCACAGAGCTGGGACTTCATGTT 
CTTCTAGAGAGGGCCACAAGAGGGCCACAGGGGTGGCCGGGAGTTGTCAGCTGATGCCTGCTGAGAGGCAGGAAT 
TGTGCCAGTGAGTGACAGTGATGAGGGAGTGTCTCTTCTTGGGGAGGAAAGAAGGTAGAGCCTTTCTGTCTGAAT 
GAAAGGCCAAGGCTACAGTACAGGGCCCCGCCCCAGCCAGGGTGTTAATGCCCACGTAGTGGAGGCCTCTGGCAG 
AT CCTG C ATTC CAAGGTCACTGGACTGT ACGTTTTTATGGTTGTGGGAAGGGTGGGTGGCTTTAG AATTAAGGGC 
CTTGTAGGCTTTGGCAGGTAAGAGGGCCCAAGGTAAGAACGAGAGCCAACGGGCACAAGCATTCTATATATAAGT 
GGCTCATTAGGTGTTTATTTTGTTCTATTTAAGAATTTGTTTTATTAAATTAATATAAAAATCTTTGTAAATCTC 
TAAAA 
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FTGIJRE 22 



MFLATLSFLLPFAHPFGTVSCEYMIiGSPLSSLAQVNLSPFSHPKVHMDPNYCHPSTSLHLCS 
LAWSFTRLLHPPLSPGISQWKDHVTKPTAMAQGRVAHLIEWKGWSKPSDSPAALESAFSSY 
SDLSEGEQEARFAAGVAEQFAIAEAKLRAWSSVDGEDSTDDSYDEDFAGGMDTDMAGQLPLG 
PHLQDLFTGHRFSRPVRQGSVEPESDCSQTVSPDTLCSSLCSLEDGLLGSPARLASQLLGDE 
LLLAKLPPSRESAFRSLGPLEAQDSLYNSPLTESCLSPAEEEPAPCKDCQPLCPPLTGSWER 
QRQASDLASSGWSLDEDEAEPEEQ 

Signal peptide: 

amino acids 1-15 

Casein kinase II phosphorylation site. 

amino acids 123-127, 128-132, 155-159, 162-166, 166-170, 228-232, 
285-289, 324-328 

Tyrosine kinase phosphorylation site. 

amino acids 44-52 

N-myristoylation site. 

amino acids 17-23, 26-32, 173-179 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 11-22 
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FIGURE 23 

GGTTCCTGGGCGCTCTGTTACACAAGCAAGATACAGCCAGCCCCACCTAATTTTGTTTCCCT 

GGCACCCTCCTGCTCAGTGCGACATTGTCACACTTAACCCATCTGTTTTCTCTAATGCACGA 

CAGATTCCTTTCAGACAGGACAACTGTGATATTTCAGTTCCTGATTGTAAATACCTCCTAAG 

CCTGAAGCTTCTGTTACTAGCCATTGTGAGCTTCAGTTTCTTCATCTGCAAAATGGGCATAA 

TACAATCTATTCTTGCCACATCAAGGGATTGTTATTCCTTTAAAAAAAAACCAATACCAAAG 

AAGCCTACAATGTTGGCCTTAGCCAAAATTCTGTTGATTTCAACGTTGTTTTATTCACTTCT 

ATCGGGGAGCCATGGAAAAGAAAATCAAGACATAAACACAACACAGAACATTGCAGAAGTTT 

TTAAAACAATGGAAAATAAACCTATTTGTTTGGAAAGTGAAGCAAACTTAAACTCAGATAAA 

GAAAATATAACCACCTCAAATCTCAAGGCGAGTCATTCCCCTCCTTTGAATCTACCCAACAA 

CAGCCACGGAATAACAGATTTCTCCAGTAACTCATCAGCAGAGCATTCTTTGGGCAGTCTAA 

AACCCACATCTACCATTTCCACAAGCCCTCCCTTGATCCATAGCTTTGTTTCTAAAGTGCCT 

* 

TGGAATGCACCTATAGCAGATGAAGATCTTTTGCCCATCTCAGCACATCCCAATGCTACACC 
TGCTCTGTCTTCAGAAAACTTCACTTGGTCTTTGGTCAATGACACCGTGAAAACTCCTGATA 
ACAGTTCCATTACAGTTAGCATCCTCTCTTCAGAACCAACTTCTCCATCTGTGACCCCCTTG 
ATAGTGGAACCAAGTGGATGGCTTACCACAAACAGTGATAGCTTCACTGGGTTTACCCCTTA 
TCAAGAAAAAACAACTCTACAGCCTACCTTAAAATTCACCAATAATTCAAAACTCTTTCCAA 
ATACGTCAGATCCCCAAAAAGAAAATAGAAATACAGGAATAGTATTCGGGGCCATTTTAGGT 
GCTATTCTGGGTGTCTCATTGCTTACTCTTGTGGGCTACTTGTTGTGTGGAAAAAGGAAAAC 
GGATTCATTTTCCCATCGGCGACTTTATGACGACAGAAATGAACCAGTTCTGCGATTAGACA 
ATGCACCGGAACCTTATGATGTGAGTTTTGGGAATTCTAGCTACTACAATCCAACTTTGAAT 
GATT CAGC CATG CC AGAAAGTGAAGAAAATGCACGTGATGGCATTCCTATGGATGACATACC 
TCCACTTCGTACTTCTGTATAGAACTAACAGCAAAAAGGCGTTAAACAGCAAGTGTCATCTA 
CATCCTAGCCTTTTGACAAATTCATCTTTCAAAAGGTTACACAAAATTACTGTCACGTGGAT 
TTTGTCAAGGAGAATCATAAAAGCAGGAGACCAGTAGCAGAAATGTAGACAGGATGTATCAT 
CCAAAGGTTTTCTTTCTTACAATTTTTGGCCATCCTGAGGCATTTACTAAGTAGCCTTAATT 
TGTATTTTAGTAGTATTTTCTTAGTAGAAAATATTTGTGGAATCAGATAAAACTAAAAGATT 
TCACCATTACAGCCCTGCCTCATAACTAAATAATAAAAATTATTCCACCAAAAAATTCTAAA 
ACAATGAAGATGACTCTTTACTGCTCTGCCTGAAGCCCTAGTACCATAATTCAAGATTGCAT 
TTTCTTAAATGAAAATTGAAAGGGTGCTTTTTAAAGAAAATTTGACTTAAAGCTAAAAAGAG 
GACATAGCCCAGAGTTTCTGTTATTGGGAAATTGAGGCAATAGAAATGACAGACCTGTATTC 
TAGTACGTTATAATTTTCTAGATCAGCACACACATGATCAGCCCACTGAGTTATGAAGCTGA 
CAATGACTGCATTCAACGGGGCCATGGCAGGAAAGCTGACCCTACCCAGGAAAGTAATAGCT 
TCTTTAAAAGTCTTCAAAGGTTTTGGGAATTTTAACTTGTCTTAATATATCTTAGGCTTCAA 
TTATTTGGGTGCCTTAAAAACT CAATGAGAATCATGGT 
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FIGURE 24 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA58732 
xsubunit 1 of 1, 334 aa, 1 stop 
><MW: 36294, pi: 4.98, NX(S/T): 13 

MLALAKILLISTLFYSLLSGSHGKENQDINTTQNIAEVFKTMENKPISLESEANLNSDKENI 
TTSNLKASHSPPLNLPNNSHGITDFSSNSSAEHSLGSLKPTSTISTSPPLIHSFVSKVPWNA 
PIADEDLLPISAHPNATPALSSENFTWSLVNDTVKTPDNSSITVSILSSEPTSPSVTPLIVE 
PSGWLTTNSDSFTGFTPYQEKTTLQPTLKFTNNSKLFPNTSDPQKENRNTGIVFGAILGAIL 
GVSLLTLVGYLLCGKRKTDSFSHRRLYDDRNEPVLRLDNAPEPYDVSFGNSSYYNPTLNDSA 
MPESEENARDGIPMDDIPPLRTSV 

Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 235-262 

N-glycosylation site. 

amino acids 30-34, 61-65, 79-83, 90-94, 148-152, 155-159, 
163-167, 218-222, 225-229, 298-302, 307-311 
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FTGURE 25 

AACAGGATCTCCTCTTGCAGTCTGCAGCCCAGGACGCTGATTCCAGCAGCGCCTTACCGCGC 
AG CC CGAAGATTCACTATGGTGAAAATCGC CTTC AATACCCCTACCGC CGTG CAAAAGGAGG 
AGGCGCGGCAAGACGTGGAGGCCCTCCTGAGCCGCACGGTCAGAACTCAGATACTGACCGGC 
AAGGAGCTCCGAGTTGCCACCCAGGAAAAAGAGGGCTCCTCTGGGAGATGTATGCTTACTCT 
CTTAGGCCTTTCATTCATCTTGGCAGGACTTATTGTTGGTGGAGCCTGCATTTACAAGTACT 
TCATGCCCAAGAGCACCATTTACCGTGGAGAGATGTGCTTTTTTGATTCTGAGGATCCTGCA 
AATTCCCTTCGTGGAGGAGAGCCTAACTTCCTGCCTGTGACTGAGGAGGCTGACATTCGTGA 
GGATGACAACATTGCAATCATTGATGTGCCTGTCCCCAGTTTCTCTGATAGTGACCCTGCAG 
CAATTATTCATGACTTTGAAAAGGGAATGACTGCTTACCTGGACTTGTTGCTGGGGAACTGC 
TATCTGATGCCCCTCAATACTTCTATTGTTATGCCTCCAAAAAATCTGGTAGAGCTCTTTGG 
CAAACTGGCGAGTGGCAGATATCTGCCTCAAACTTATGTGGTTCGAGAAGACCTAGTTGCTG 
TGGAGGAAATTCGTGATGTTAGTAACCTTGGCATCTTTATTTACCAACTTTGCAATAACAGA 
AAGTCCTTCCGCCTTCGTCGCAGAGACCTCTTGCTGGGTTTCAACAAACGTGCCATTGATAA 
ATGCTGGAAGATTAGACACTTCCCCAACGAATTTATTGTTGAGACCAAGATCTGTCAAGAGT 
AAGAGGCAACAGATAGAGTGTCCTTGGTAATAAGAAGTCAGAGATTTACAATATGACTTTAA 
CATTAAGGTTTATGGGATACTCAAGATATTTACTCATGCATTTACTCTATTGCTTATGCTTT 
AAAAAAAGGAAAAAAAAAAAAACTACTAACCACTGCAAGCTCTTGTCAAATTTTAGTTTAAT 
TGGCATTGCTTGTTTTTTGAAACTGAAATTACATGAGTTTCATTTTTTCTTTGCATTTATAG 
GGTTTAGATTTCTGAAAGCAGCATGAATATATCACCTAACATCCTGACAATAAATTCCATCC 
GTTGTTTTTTTTGTTTGTTTGTTTTTTCTTTTCCTTTAAGTAAGCTCTTTATTCATCTTATG 
GTGGAGCAATTTTAAAATTTGAAATATTTTAAATTGTTTTTGAACTTTTTGTGTAAAATATA 
TCAGATCTCAACATTGTTGGTTTCTTTTGTTTTTCATTTTGTACAACTTTCTTGAATTTAGA 
AATTACAT CTTTGCAGTTCTGTTAGGTG CTCTGTAATTAACCTGACTTATATGTGAACAATT 
TTCATGAGACAGTCATTTTTAACTAATGCAGTGATTCTTTCTCACTACTATCTGTATTGTGG 
AATGCACAAAATTGTGTAGGTGCTGAATGCTGTAAGGAGTTTAGGTTGTATGAATTCTACAA 
CCCTATAATAAATTTTACTCTATACAAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA58 828 
<subunit 1 of 1, 263 aa, 1 stop 
<MW: 29741, pi: 5.74, NX(S/T): 1 

MVKIAFNTPTAVQKEEARQDVEALLSRTVRTQILTGKELRVATQEKEGSSGRCMLTLLGLSF 
ILAGLIVGGACIYKYFMPKSTIYRGEMCFFDSEDPANSLRGGEPNFLPVTEEADIREDDNIA 
IIDVPVPSFSDSDPAAIIHDFEKGMTAYLDLLLGNCYLMPLNTSIVMPPKNLVELFGKLASG 
R YL PQT YWREDLVAVEE I RDVSNLG I F I YQLCNNRKS FRLRRRDLLLGFNKRA I DKCWK I R 
HFPNEFIVETKICQE 

Type II transmembrane domain: 

amino acids 53-75 

N-glycosylation site. 

amino acids 166-170 

Casein kinase II phosphorylation site. 

amino acids 35-39, 132-136, 134-138 

N-myristoylation site. 

amino acids 66-72, 103-109 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 63-74 
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FIGURE 27 

GGAGGAGGGAGGGCGGGCAGGCGCCAGCCCAGAGCAGCCCCGGGCACCAGCACGGACTCTCT 

CTTCCAGCCCAGGTGCCCCCCACTCTCGCTCCATTCGGCGGGAGCACCCAGTCCTGTACGCC 

AAGGAACTGGTCCTGGGGGCACCATGGTTTCGGCGGCAGCCCCCAGCCTCCTCATCCTTCTG 

TTGCTGCTGCTGGGGTCTGTGCCTGCTACCGACGCCCGCTCTGTGCCCCTGAAGGCCACGTT 

CCTGGAGGATGTGGCGGGTAGTGGGGAGGCCGAGGGCTCGTCGGCCTCCTCCCCGAGCCTCC 

CGCCACCCTGGACCCCGGCCCTCAGCCCCACATCGATGGGGCCCCAGCCCACAACCCTGGGG 

GGCCCATCACCCCCCACCAACTTCCTGGATGGGATAGTGGACTTCTTCCGCCAGTACGTGAT 

GCTGATTGCTGTGGTGGGCTCCCTGGCCTTTCTGCTGATGTTCATCGTCTGTGCCGCGGTCA 

TCACCCGGCAGAAGCAGAAGGCCTCGGCCTATTACCCATCGTCCTTCCCCAAGAAGAAGTAC 

GTGGACCAGAGTGACCGGGCCGGGGGCCCCCGGGCCTTCAGTGAGGTCCCCGACAGAGCCCC 

CGACAGCAGGCCCGAGGAAGCCCTGGATTCCTCCCGGCAGCTCCAGGCCGACATCTTGGCCG 

CCACCCAGAACCTCAAGTCCCCCACCAGGGCTGCACTGGGCGGTGGGGACGGAGCCAGGATG 

GTGGAGGGCAGGGGCGCAGAGGAAGAGGAGAAGGGCAGCCAGGAGGGGGACCAGGAAGTCCA 

GGGACATGGGGTCCCAGTGGAGACACCAGAGGCGCAGGAGGAGCCGTGCTCAGGGGTCCTTG 

AGGGGGCTGTGGTGGCCGGTGAGGGCCAAGGGGAGCTGGAAGGGTCTCTCTTGTTAGCCCAG 

GAAGCCCAGGGACCAGTGGGTCCCCCCGAAAGCCCCTGTGCTTGCAGCAGTGTCCACCCCAG 

TGTCTAACAGTCCTCCCGGGCTGCCAGCCCTGACTGTCGGGCCCCCAAGTGGTCACCTCCCC 

GTGTATGAAAAGGCCTTCAGCCCTGACTGCTTCCTGACACTCCCTCCTTGGCCTCCCTGTGG 

TGCCAATCCCAGCATGTGCTGATTCTACAGCAGGCAGAAATGCTGGTCCCCGGTGCCCCGGA 

GGAATCTTACCAAGTGCCATCATCCTTCACCTCAGCAGCCCCAAAGGGCTACATCCTACAGC 

ACAGCTCCCCTGACAAAGTGAGGGAGGGCACGTGTCCCTGTGACAGCCAGGATAAAACATCC 

CCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCCGGCCCAAACTACTTTTTAAAACA 

GCTACAGGGTAAAATCCTGCAGCACCCACTCTGGAAAATACTGCTCTTAATTTTCCTGAAGG 

TGGCCCCCTGTTTCTAGTTGGTCCAGGATTAGGGATGTGGGGTATAGGGCATTTAAATCCTC 

TCAAGCGCTCTCCAAGCACCCCCGGCCTGGGGGTGAGTTTCTCATCCCGCTACTGCTGCTGG 

GATCAGGTTGAATGAATGGAACTCTTCCTGTCTGGCCTCCAAAGCAGCCTAGAAGCTGAGGG 

GCTGTGTTTGAGGGGACCTCCACCCTGGGGAAGTCCGAGGGGCTGGGGAAGGGTTTCTGACG 

CCCAGCCTGGAGCAGGGGGGCCCTGGCCACCCCCTGTTGCTCACACATTGTCTGGCAGCCTG 

TGTCCACAATATTCGTCAGTCCTCGACAGGGAGCCTGGGCTCCGTCCTGCTTTAGGGAGGCT 

CTGGCAGGAGGTCCTCTCCCCCATCCCTCCATCTGGGGCTCCCCCAACCTCTGCACAGCTCT 

CCAGGTGCTGAGATATAATGCACCAGCACAATAAACCTTTATTCCGGCCTGAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA 

/170 
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FIGURE 28 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA58852 
xsubunit l of 1, 283 aa, 1 stop 
><MW: 29191, pi: 4.52, NX(S/T): 0 

MVSAAAPSLLILLLLLLGSVPATDARSVPLKATFLEDVAGSGEAEGSSASSPSLPPPWTPAL 
SPTSMGPQPTTLGGPSPPTNFLDGIVDFFRQYVMLIAWGSLAFLLMFIVCAAVITRQKQKA 
SAYYPSSFPKKKYVDQSDRAGGPRAFSEVPDRAPDSRPEEALDSSRQLQADILAATQNLKSP 
TRAALGGGDGARMVEGRGAEEEEKGSQEGDQEVQGHGVPVETPEAQEEPCSGVLEGAWAGE 
GQGELEGSLLLAQEAQGPVGPPESPCACSSVHPSV 

Signal peptide: 

amino acids 1-2 5 

Transmembrane domain: 

amino acids 94-118 

N-myristoylation site. 

amino acids 18-24, 40-46, 46-52, 145-151, 192-198, 193-199, 
211-217, 238-244, 242-248 
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FIGURE 29 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGA 

GGGAGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCG 

GCAAGGAGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATG 

AAGTTCCAGGGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGG 

CCCCCTGCAGAGCGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCC 

TGGGAGACGCCCTGAGCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCT 

GGCTCTAAAGTCAGTGAGGCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAG 

GCAGGTTCCAGGCTTTGGCGCAGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATG 

CTCTGGGAAACACTGGGCACGAGATTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCA 

GATGCTGTCCGCGGCTCCTGGCAGGGGGTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGG 

AGGCCATGGCATCTTTGGCTCTCAAGGTGGCCTTGGAGGCCAGGGCCAGGGCAATCCTGGAG 

GTCTGGGGACTCCGTGGGTCCACGGATACCCCGGT^AACTCAGCAGGCAGCTTTGGAATGAAT 

CCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGAGGGCCACCAAACTTTGGGACCAACAC 

TCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAGAGCCAGCAACCAGAATGAAGGGT 

GCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCAACTCTGGGGGAGGCAGCGGC 

TCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAACAATGGCAGCAGCAGTGG 

TGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAGTGGCGGCAGCAGTG 

GTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGTCCTCCTGGGGA 

TCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGACATAAACC 

CGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGGCTTCA 

GAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTGGA 

GGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGT 

TGGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCT 

GGAAGAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGA 

AGCTCTCGCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACT 

CCCTCCTTAAAACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTA 

GCTGCCCCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 30 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA59212 
xsubunit 1 of 1, 440 aa, 1 stop 
><MW: 42208, pi: 6.36, NX(S/T): 1 

MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGA 
AGSKVS EALGQGTREAVGTGVRQVPG FGAADALGNRVGEAAHALGNTGHE I GRQAEDV I RHG 
ADAVRGSWQGVPGHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGM 
NPQGAPWGQGGNGGPPNFGTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGS 
GSQSGSSGSGSNGDNNNGSSSGGSSSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSW 
GSSTGSSSGNHGGSGGGNGHKPGCEKPGNEARGSGESGIQGFRGQGVSSNMREISKEGNRLL 
GGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGMFNFDTFWKNFKSKLGFINWDAINKDQ 
RSSRIP 



Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 

74-80, 90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 

155-161, 159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 

218-224, 236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 

249-252, 253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 

279-285, 283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 

295-301, 298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 

323-329, 325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 
383-389, 387-393, 389-395, 395-401 



Cell attachm nt sequence. 

amino acids 301-304 
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FT CI IRE 31 

GACCGGTCCCTCCGGTCCTGGATGTGCGGACTCTGCTGCAGCGAGGGCTGCAGGCCCGCCGGGCGGTGCTCACCG 
TGCCCTGGCTGGTGGAGTTTCTCTCCTTTGCTGACCATGTTGTTCCCTTGCTGGAATATTACCGGGACATCTTCA 
CTCTCCTGCTGCGCCTGCACCGGAGCTTGGTGTTGTCGCAGGAGAGTGAGGGGAAGATGTGTTTCCTGAACAAGC 
TGCTGCTACTTGCTGTCCTGGGCTGGCTTTTCCAGATTCCCACAGTCCCTGAGGACTTGTTCTTTCTGGAAGAGG 
GTCC CT CATATGCCTTTGAGGTGGAC ACAGT AGCC CCAG AG C ATGGCTTGGACAATGCGCCTGTGGTGGACCAGC 
AGCTGCTCTACACCTGCTGCCCCTACATCGGAGAGCTCCGGAAACTGCTCGCTTCGTGGGTGTCAGGCAGTAGTG 
G ACG GAGTGGGGGCTTCATG AGG AAAATCACCCC CACCACT ACCACCAGCCTGGG AGCCC AGCCTTCCCAGACCA 
GCGAGGGGCTGCAGGC^CAGCTCGCCCAGGCCTTTTT 

TCGTGGCAGAAAGAATTGGATCAAACTGTGTCAAACATATCAAGGCTACACTGGTGGCAGATCTGGTGCGCCAGG 

CAGAGTCACTTCTCCAAGAGCAGCTGGTGACACAGGGAGAGGAAGGGGGAGACCCAGCCCAGCTGTTGGAGATCT 

TGTGTTCCCAGCTGTGCCCTCACGGGGCCCAGGCATTGGCCCTGGGGCGGGAGTTCTGTCAAAGGAAGAGCCCTG 

GGGCTGTGCGGGCGCTGCTTCCAGAGGAGACCCCGGCAGCCGTTCTGAGCAGTGCAGAGAACATTGCTGTGGGGC 

TTG C AACAGAGAAAGC CTGTGCTTGG CTGTCAGCCAACATC ACAGCACTGATCAGGAGGGAGGTGAAAGCAGCAG 

TGAGTCGCACACTTCGAGCCCAGGGTCCTGAACCTGCTGCCCGGGGGGAGCGGAGGGGCTGCTCCCGCGCCTGAC 

GTGCTCTCCTTGGCCGTGGGGCCACGGGACCCTGACGAGGGAGTCTCCCCAGAGCATCTGGAACAGCTCCTAGGC 

CAGCTGGGCCAGACGCTGCGGTGCCGCCAGTTCCTGTGCCCACCTGCTGAGCAGCATCTGGCAAAGTGCTCTGTG 

GAGTTAGCTTCCCTCCTCGTTGCAGATCAAATTCCTATCCTAGGGCCCCCGGCACAGTACAGGCTGGAGAGAGGG 

CAGGCTCGAAGGCTTCTGCACATGCTGCTTTCCTTGTGGAAGGAAGACTTTCAGGGGCCGGTTCCGCTGCAGCTG 

CTGCTG AGCCCAAGAAATGTGGG GC TTCTGG CAGAC ACAAGG CCAAGGG AGTGGG ACTTGCTGCTATl , Cl"i'GCTA 

CGGGAGCTGGTGGAGAAGGGTCTGATGGGACGGATGGAGATAGAGGCCTGCCTGGGCAGCCTCCACCAGGCCCAG 

TGGCCAGGGGACTTTGCTGAAGAATTAGCAACACTGTCTAATCTGTTTCTAGCCGAGCCCCACCTGCCAGAACCC 

CAGCTAAGAGCCTGTGAGTTGGTGCAGCCAAACCGGGGCACTGTGCTGGCCCAGAGCTAGGGCTGAGAAGTGGCC 

CTGCCTTGGGCATTGCACCAGAACCCTGGACCCCCGCCTCACGAGGAGGCCCAAGTGCCCAATGCAGACCCTCAC 

TGGTTGGGGTGTAGCTGGGTCTACAGTCAGACTTCCTGCTCTAAGGGTGTCACTGCCTGGCATCCCACCACGCGA 

ATCCTAGAGGAAGGAGAGTTGGCCTGATTTGGGATTATGGCAGAAAAGTCCAGAGATGCCAGTCCTGGAGTAGAA 

GAGGTGGTGTTTGTTTATCTCTTGGATACTAAATGAAATGAGGTGTGTGGGCTTGTCAACACAGAATTCAAGCCT 

C ATTTG CTAT CCCAG CATCTCTT AAAACTTTGTAGTCTTGG AATTCATG AC AGAGGC AAATG ACTCCTGCTTAAC 

TTATGAAGAAAGTTAAAACATGAATCTTGGGAGTCTACATTTTCTTATCACCAGGAGCTGGACTGCCATCTCCTT 

ATAAATGCCTAACACAGGCCGGGTCTGGTGGCTCATGCCTGTAATCCCAGCACTTTGAGAGGCCTGAGGTCGGCG 

GACTGCCTGAGGTCAGGAATTCAAGACCAGCCTGGCCAACATGGCAAAACCCC^TCTCTACTAAAAATAAAAAAA 

TTATTAGCTGGGCATGGTGGTGTGTGCCTGTAATCCCAGCTACTCAGGAGGATGAGGCAGGAGACCTGCTTGAAC 

CTGGAGGTGGAGGTTGCAGTGAGCCGAGGTCGCACCACTGCACTCCAGTCTGGGTAACAGAGCGAGACTTTCTAG 

AAAAAGC CTAAC AAAC AGATAAGGTAGG ACTCAAC CAACTG AAACCTGACTTTCCCCCTGTACCTTCAGCCCCTG 

TGCAGGTAGTAACCTCTTGAGACCTCTCCCTGACCAGGGACCAAGCACAGGGCATTTAGAGCTTTTTAGAATAAA 

CTGGTTTTCTTTAAAAAAAAAAAAAAAAAAAGGGCGGCCGCCCTTTTTTTTTTTT^ 

TTTTTTTTTTTTTTTTAAAAAGGGCTTTTAT^ 

TGGGGCGTGTCCTGTAGGGAAAGGCCCTGTTTTCCCTGAGGCGGGGCTGGGCTTGTCCATGGGTCCGCGGAGCTG 

GCCGTGCTTGGCGCCCTGGCGTGTGTCTAGCTGCTTCTTGCCGGGCACAGAGCTGCGGGGTCTGGGGGCACCGGG 

AGCTAAGAGCAGGCTCTGGTGCAGGGGTGGAGGCCTGTCTCTTAACCGACACCCTGAGGTGCTCCTGAGATGCTG 

GGTCC ACCCTGAGTGGCACGGGG AGCAGCTGTGGCCGGTG CT CCTTCYTAGGCCAGTCCTGGGGAAACTAAGCTC 

GGGCCCTTCTTTGCAAAGACCGAGGATGGGGTGGGTGTGGGGGACTCATGGGGAATGGCCTGAGGAG 

GAAGAGG^CGCCGGTTTGTTGGCTGCAGCGGCCTGGAGCGCCTCTCTCCTGAG 

ATGAAGAACATGCCGTCTCGGTGTCTCAGGGCTATTAGGACTTGCCCTCAGGAAGTGGCCTTGGACGAG 
GTTATTTTCACAACTGTCCTGCGACGTTGGCCTGGGCACGTCA^ 

GTCGCGGTCGGGAGTGCGCAGCCAGAG^CGGGGCCAGACGTGCGCCTG^3GGGTGAGGGGAGGCGCCCCX3GGAGGG 
CCTCACAGGAAGTTGGGCTCCCGCACCACCAGGCAGGGCGGGCTCCCGCCGCCGCCGCCGCCACCACCGTCCAGG 
GG CCGGTAGACAAAGTGGAAGTCGCGCTTGGGCTCGCTGCG CAGCAGGTAGCC CTTGATG CAGTGCGGCAGCGCG 
TCGTCCGCCAGCTGGAAGCAGCGCCCGTCCACCAGCACGAACAGCCGGTGCGCCT 
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FIGURE 32 

MCFLNKLLLLAVLGWLFQIPTVPEDLFFLEEGPSYAFEVDTVAPEHGLDNAPWDQQLLYTC 
CPYIGELRKLLASWVSGSSGRSGGFMRKITPTTTTSLGAQPSQTSQGLQAQLAQAFFHNQPP 
SLRRTVEFVAERIGSNCVKHIKATLVADLVRQAESLLQEQLVTQGEEGGDPAQLLEILCSQL 
C PHGAQAIjALGRE f cqrks p GAVRALL P EET paavls S AENI AVGLATE KACAWLS AN I TAL 
IRREVKAAVSRTLRAQGPEPAARGERRGCSRA 

Signal peptide: 

amino acids 1-18 

N-glycosylation site. 

amino acids 244-248 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 89-93 

Casein kinase II phosphorylation site. 

amino acids 21-25, 167-171, 223-227 

N-myristoylation site. 

amino acids 100-106, 172-178, 207-213 

Hicrobodies C- terminal targeting signal. 

amino acids 278-282 
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FIGURE 33 

TCCCTTGACAGGTCTGGTGGCTGGTTCGGGGTCTACTGAAGGCTGTCTTGATCAGGAAACTG 
AAGACTCTCTGCTTTTGCCACAGCAGTTCCTGCAGCTTCCTTGAGGTGTGAACCCACATCCC 
TG CCCCCAGGG CCACCTGCAGGACGCCGACACCTACCCCTCAGCAGACGCCGGAGAGAAATG 
AGTAGCAACAAAGAGCAGCGGTCAGCAGTGTTCG TGATCCTCTTTG CCCTCATCACCATCCT 
CATCCTCTACAGCTCCAACAGTGCCAATGAGGTCTTCCATTACGGCTCCCTGCGGGGCCGTA 
GCCGCCGACCTGTCAACCTCAAGAAGTGGAGCATCACTGACGGCTATGTCCCCATTCTCGGC 
AACAAGACACTGCCCTCTCGGTGCCACCAGTGTGTGATTGTCAGCAGCTCCAGCCACCTGCT 
GGGCACCAAGCTGGGCCCTGAGATCGAGCGGGCTGAGTGTACAATCCGCATGAATGATGCAC 
CCACCACTGGCTACTCAGCTGATGTGGGCAACAAGACCACCTACCGCGTCGTGGCCCATTCC 
AGTGTGTTCCGCGTGCTGAGGAGGCCCCAGGAGTTTGTCAACCGGACCCCTGAAACCGTGTT 
CATCTTCTGGGGGCCCCCGAGCAAGATGCAGAAGCCCCAGGGCAGCCTCGTGCGTGTGATCC 
AGCGAGCGGGCCTGGTGTTCCCCAACATGGAAGCATATGCCGTCTCTCCCGGCCGCATGCGG 
CAATTTGACGACCTCTTCCGGGGTGAGACGGGCAAGGACAGGGAGAAGTCTCATTCGTGGTT 
GAGCACAGGCTGGTTTACCATGGTGATCGCGGTGGAGTTGTGTGACCACGTGCATGTCTATG 
GCATGGTCCCCCCCAACTACTGCAGCCAGCGGCCCCGCCTCCAGCGCATGCCCTACCACTAC 
TACGAGCCCAAGGGGCCGGACGAATGTGTCACCTACATCCAGAATGAGCACAGTCGCAAGGG 
CAACCACCACCGCTTCATCACCGAGAAAAGGGTCTTCTCATCGTGGGCCCAGCTGTATGGCA 
TCACCTTCTCCCACCCCTCCTGGACCTAGGCCACCCAGCCTGTGGGACCTCAGGAGGGTCAG 
AGGAGAAGCAGCCTCCGCCCAGCCGCTAGGCCAGGGACCATCTTCTGGCCAATCAAGGCTTG 
CTGGAGTGTCTCCCAGCCAATCAGGGCCTTGAGGAGGATGTATCCTCCAGCCAATCAGGGCC 
TGGGGAATCTGTTGGCGAATCAGGGATTTGGGAGTCTATGTGGTTAATCAGGGGTGTCTTTC 
TTGTGCAGTCAGGGTCTGCGCACAGTCAATCAGGGTAGAGGGGGTATTTCTGAGTCAATCTG 
AGGCTAAGGACATGTCCTTTCCCATGAGGCCTTGGTTCAGAGCCCCAGGAATGGACCCCCCA 
ATCACTCCCCACTCTGCTGGGATAATGGGGTCCTGTCCCAAGGAGCTGGGAACTTGGTGTTG 
CCCCCTCAATTTCCAG CACCAGAAAGAGAGATTGTGTGGGGGTAGAAG CTGTCTGGAGGCC C 
GGCCAGAGAATTTGTGGGGTTGTGGAGGTTGTGGGGGCGGTGGGGAGGTCCCAGAGGTGGGA 
GGCTGGCATCCAGGTCTTGGCTCTGCCCTGAGACCTTGGACAAACCCTTCCCCCTCTCTGGG 
CACCCTTCTGCCCACACCAGTTTCCAGTGCGGAGTCTGAGACCCTTTCCACCTCCCCTACAA 
GTGCCCTCGGGTCTGTCCTCCCCGTCTGGACCCTCCCAGCCACTATCCCTTGCTGGAAGGCT 
CAGCTCTTTGGGGGGTCTGGGGTGACCTCCCCACCTCCTGGAAAACTTTAGGGTATTTTTGC 
GCAAACTCCTTCAGGGTTGGGGGACTCTGAAGGAAACGGGACAAAACCTTAAGCTGTTTTCT 
TAGCCCCTCAGCCAGCTGCCATTAGCTTGGCTCTTAAAGGGCCAGGCCTCCTTTTCTGCCCT 
CTAGCAGGGAGGTTTTCCAACTGTTGGAGGCGCCTTTGGGGCTGCCCCTTTGTCTGGAGTCA 
CTGGGGGCTTCCGAGGGTCTCCCTCGACCCTCTGTCGTCCTGGGATGGCTGTCGGGAGCTGT 
ATCACCTGGGTTCTGTCCCCTGGCTCTGTATCAGGCACTTTATTAAAGCTGGGCCTCAGTGG 
GGTGTGTTTGTCTCCTGCTCTTCTGGAGCCTGGAAGGAAAGGGCTTCAGGAGGAGGCTGTGA 
GGCTGGAGGGACCAGATGGAGGAGGCCAGCAGCTAGCCATTGCACACTGGGGTGATGGGTGG 
GGGCGGTGACTGCCCCAGACTTGGTTTTGTAATGATTTGTACAGGAATAAACACACCTACGC 
TCCGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 34 



MS SNKEQRS AVFVILFAL I TI L I LYS SNS ANE VFHYGSLRGR SRRPVNLKKW S ITDGYVP I L 
GNKTLPSRCHQCVIVSSSSHLLGTKLGPEIERAECTIRMNDAPTTGYSADVGNKTTYRWAH 
SSVFRVLRRPQEFVNRTPETVFIFWGPPSKMQKPQGSLVRVIQRAGLVFPNMEAYAVSPGRM 
RQFDDLFRGETGKDREKSHSWLSTGWFTMVIAVELCDHVHVYGMVPPNYCSQRPRLQRMPYH 
YYEPKGPDECVTYIQNEHSRKGNHHRFITEKRVFSSWAQLYGITFSHPSWT 

Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 9-31 (type II) 

N-glycosylation site. 

amino acids 64-68, 115-119 

cAMP- and cGMP- dependent protein kinase phosphorylation site, 
amino acids 50-54 

Casein kinase II phosphorylation site. 

amino acids 3-7, 29-33, 53-57, 197-201 

Tyrosine kinase phosphorylation site. 

amino acids 253-262 

N-myristoylation site. 

amino acids 37-43, 114-120, 290-294 
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FIGURE 35 

GTTTCTCATAGTTGGCGTCTTCTAAAGGAAAAACACTAAAATGAGGAACTCAGCGGACCGGGAGCGACGCAGCTT 
GAGGGAAG CATCCCTAGCTG TTGG CGCAGAGGGGCGAGGCTGAAGCCGAG TGGCCCG AGGTG TCTGAGGGGCTGG 
GGCAAAGGTGAAAGAGTTTCAGAACAAGCTTCCTGGAACCCATGACCCATGAAGTCTTGTCGACATTTATACCGT 
CTGAGGGTAGCAGCTCGAAACTAGAAGAAGTGGAGTGTTGCCAGGGACGGCAGTATCTCTTTGTGTGACCCTGGC 
GGCCTATGGGACGTTGGCTTCAGACCTTTGTGATACACCATCCTGCGTGGGACGATGACGGCGTGGAGAGGAATG 
AGGCCTGAGGTCACACTGGCTTGCCTCCTCCTAGCCACAGCAGGCTGCTTTGCTGACTTGAACGAGGTCCCTCAG 
GTCACCGTCCAGCCTGCGTCCACCGTCCAGAAGCCCGGAGGCACTGTGATCTTGGGCTGCGTGGTGGAACCTCCA 
AGGATGAATGTAACCTGGCGCCTGAATGGAAAGGAGCTGAATGGCTCGGATGATGCTCTGGGTGTCCTCATCA.ee 
CACGGGACCCTCGTCATCACTGCCCTTAAa^ACCACACTGTGGGACGGTACCAGTGTGTGGCCCGGATGCCTGCG 
GGGGCTGTGGCCAGCGTGCCAGCCACTGTGACACTAGCCAATCTCCAGGACTTCAAGTTAGATGTGCAGCACGTG 
ATTGAAGTGGATGAGGGAAACACAGCAGTCATTGCCTGCCACCTGCCTGAGAGCCACCCCAAAGCCCAGGTCCGG 
TACAG CGTCAAACAAG AGTGGCTGGAGGCCT CCAGAGGT AACTAC CTGATCATGCCCTCAGGGAACCTCC AGATT 
GTGAATGCCAGCCAGGAGGACGAGGGCATGTACAAGTGTGCAGCCTACAACCCAGTGACCCAGGAAGTGAAAACC 
TCCGGCTCCAGCGACAGGCTACGTGTGCGCCGCTCCACCGCTGAGGCTGCCCGCATCATCTACCCCCCAGAGGCC 
CAAACCAT CATCGTCACCAAAGGCCAGAGTCTCATTCTGGAGTGTGTGGCCAGTGGAATCCCAC CCCC ACGGGTC 
ACCTGGGCCAAGGATGGGTCCAGTGTCACCGGCTACAACAAGACGCGCTTCCTGCTGAGCAACCTCCTCATCGAC 
ACCACCAG CGAGGAGGAC TC AGGC ACCTACCGCTG CATGGC CGACAATGGGGTTGGGCAGCCCGGGGCAGCGGTC 
ATCCTCTAC7UITGTCCAGGTGTTTGAACCCCCTGAGGTCACCATGGAGCTATCCCAGCTGGTCATCCCCTGGGGC 
CAGAGTGCCAAGCTTACCTGTGAGGTGCGTGGGAACCCCCCGCCCTCCGTGCTGTGGCTGAGGAA7GCTGTGCCC 
CTCATCTCCAGCCAGCGCCTCCGGCTCTCCCGCAGGGCCCTGCGCGTGCTCAGCATGGGGCCTGAGGACGAAGGC 
GTCTACCAGTGCATGGCCGAGAACGAGGTTGGGAGCGCCCATGCCGTAGTCCAGCTGCGGACCTCCAGGCCAAGC 
ATAACCCCAAGGCTATGGCAGGATGCTGAGCTGGCTACTGGCACACCTCCTGTATCACCCTCCAAACTCGGCAAC 
CCTGAGCAGATGCTGAGGGGGCAACCGGCGCTCCCCAGACCCCCAACGTCAGTGGGGCCTGCTTCCCCGAAGTGT 
CCAGGAGAGAAGGGGCAGGGGGCTCCCGCCGAGGCTCCCATCATCCTCAGCTCGCCCCGCACCTCCAAGACAGAC 
TCATATGAACTGGTGTGGCGGCCTCGGCATGAGGGCAGTGGCCGGGCGCCAATCCTCTACTATGTGGTGAAACAC 
CGCAAGCAGGTCACAAATTCCTCTGACGATTGGACCATCTCTGGCATTCCAGCCAACCAGCACCGCCTGACCCTC 
AC CAGACTTG ACCCCGGGAGCTTGTATGAAG TGGAGATGGCAGCTTACAACTGTGCGGGAGAGGGC CAGACAGCC 
ATGGTCACCTTCCGAACTGGACGGCGGCCCAAACCCGAGATCATGGCCAGCAAAGAGCAGCAGATCCAGAGAGAC 
GACCCTGGAGCCAGTCCCCAGAGCAGCAGCCAGCCAGACCACGGCCGCCTCTCCCCCCCAGAAGCTCCCGACAGG 
CCCACCATCTCCACGGCCTCCGAGACCTCAGTGTACGTGACCTGGATTCCCCGTGGGAATGGTGGGTTCCCAATC 
CAGTCCTTCCGTGTGGAGTACAAGAAGCTAAAGAAAGTGGGAGACTGGATTCTGGCCACCAGCGCCATCCCCCCA 
TCGCGGCTGTCCGTGGAGATCACGGGCCTAGAGAAAGGCACCTCCTACAAGTTTCGAGTCCGGGCTCTGAACATG 
CTGGGGGAGAGCGAGCCCAGCGCCCCCTCTCGGCCCTACGTGGTGTCGGGCTACAGCGGTCGCGTGTACGAGAGG 
CCCGTGGCAGGTCCTTATATCACCTTCACGGATGCGGTCAATGAGACC^ 

CCAGCAAGTAACAACAACACCCCAATCCATGGCTTTTATATCTATTATCGACCCACAGACAGTGACAATGATAGT 
GACTACAAG AAGGAT ATGGTGGAAG GGG ACAAGTACTGGCACTCCATCAGCCACCTGCAGCCAGAGAC CTC CTAC 
G ACATTAAGATGCAGTG CTTCAATGAAG G AGGGG AG AGCGAGTTCAGCAACGTGATGATCTGTGAGACCAAAGCT 
CGGAAGTCTTCTGGCCAGCCTGGTCGACTGCCACCCCCAACTCTGGCCCCACCACAGCCGCCCCTTCCTGAAACC 
ATAGAGCGGCCGGTGGGCACTGGGGCCATGGTGGCTCGCTCCAGCGACCTGCCCTATCTGATTGTCGGGGTCGTC 
CTGGGCTCGATCGTTCTCATCATCGTCACCTTCATCCCCTTCTGCTTGTG^^ 

ACAACAGACCTGGGTTTTCCTCGAAGTGCCCTTCCACCCTCCTGCCCGTATACTATGGTGCCATTGGGAGGACTC 

CCAGGCC^CCAGGCCAGTGGACAGCCCTACCTCAGTGGCATCAGTGGACGGGCCTGTGCTAATGGGATCCACATG 

AATAGGGGCTGCCCCrCGGCrGCAGTGGGCTACCCGGGC^TGAAGCCCCAGC^GCACn'GCCCAGGCGAGCTT 

CAGCAGAGTGACACCAGC^GCCrrGCTGAGGC^GACCCATCTTGGCMTGGATATGACCCCCAAAGTCACCAGATC 

ACGAGGGGTCCC^GTCTAGCCCGGACGAGGGCTCTTTCTTATACACaCrrGCCCGACGACTCCACTCACCAGCTG 

CTGCAGCCCCAT(^CGACTGCTGCCAACGCCAGGAGCAGCCTGCTGCTGTGGGCC^GTCAGGGGTGAGGAGAGCC 

CCCGACAGTCCTGTCCTGGAAGCAGTGTGGGACCCTCCATTTCACTCAGGGCCCCCATGCTGCTTGGGCCTTGTG 

CCAGTTGAAGAGGTGGACAGTCCTGACTCCTGCCAAGTGAGTGGAGGAGACTGGTGTCCCCAGCACCCCGTAGGG 

GCCTACGTAGGACAGGAACCTGGAATGCAGCTCTCCCCGGGGCCACITGGTGCGTGTGTCrrTTTGAAACACCACCT 

CTC^CT^TTTAGGCAGAAGCTGATATCCCTVGAAAGACTATATATTGTTTT^ 

AGAGAC^GAGAAAATTGGTATTTATTTTTCTATTATAGCCATATTTATATATTTATGCACTTGTAAATAAATGTA 

TATGTTTTATAATTCTGGAGAGACATAAGGAGTCCTACCCGTTGAGGTTGGAGAGGGAAAATAAAGAAGCTGCCA 

CCTAAC^GGAGTCACCCAGGAAAGCACCGCACAGGCTGGCGCGGGACAGACTCCTAACCTGGGGCCTCTGCAGTG 

GCAGGCGAGGCTGCAGGAGGCCCACAGATAAGCTGGCAAGAGGAAGGATCCCAGGCACATGGTTCATCACGAGCA 

TGAGGGAACAGCAAGGGGCACGGTATCACAGCCTGGAGACACCCACAC^GATGGCTGGATCCGGTGCTACGGGAA 

ACATTTTCCTAAGATGCCCATGAGAACAGACCAAGATGTGTACAGCACTATGAG 

CAATAATCCGTGGCAACATATCrTCTGTAAAAACAAACACT 
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FIGURE 36 

MLRGTMTAWRGMRPEVTLACLLLATAGC FADLNE VP Q VTVQP AS TVQKPGGTVI LG CWE P P 
RMNVTWRLNGKELNGSDDALGVLITHGTLVITALNNHTVGRYQCVARMPAGAVASVPATVTL 
ANLQDFKLDVQHVIEVDEGNTAVIACHLPESHPKAQVRYSVKQEWLEASRGNYLIMPSGNLQ 
I VNASQEDEGMYKCAAYNPVTQEVKTSGSSDRLRVRRSTAEAARI I YPPEAQTI IVTKGQSL 
ILECVASGIPPPRVTWAKDGSSVTGYNKTRFLLSNLLIDTTSEEDSGTYRCMADNGVGQPGA 
AVILYNVQVFEPPEVTMELSQLVIPWGQSAKLTCEVRGNPPPSVLWLRNAVPLISSQRLRLS 
RRALRVLSMGPEDEGVYQCMAENEVGSAHAWQLRTSRPSITPRLWQDAELATGTPPVSPSK 
LGNPEQMLRGQPALPRPPTSVGPASPKCPGEKGQGAPAEAPIILSSPRTSKTDSYELVWRPR 
HEGSGRAP I LYYWKHRKQVTNSSDDWTI SGI PANQHRLTLTRLDPGSLYEVEMAAYNCAGE 
GQTAMVTFRTGRRPKPEIMASKEQQIQRDDPGASPQSSSQPDHGRLSPPEAPDRPTISTASE 
TSVYVTWIPRGNGGFPIQSFRVEYKKLKKVGDWILATSAIPPSRLSVEITGLEKGTSYKFRV 
RALNMLGESEPSAPSRPYWSGYSGRVYERPVAGPYITFTDAVNETTIMLKWMYIPASNNNT 
PIHGFYIYYRPTDSDNDSDYKKDMVEGDKYWHSISHLQPETSYDIKMQCFNEGGESEFSNVM 
ICETKARKSSGQPGRLPPPTLAPPQPPLPETIERPVGTGAMVARSSDLPYLIVGWLGSIVL 
IIVTFIPFCLWRAWSKQKHTTDLGFPRSALPPSCPYTMVPLGGLPGHQASGQPYLSGISGRA 
CANGIHMNRGCPSAAVGYPGMKPQQHCPGELQQQSDTSSLLRQTHLGNGYDPQSHQITRGPK 
SSPDEGSFLYTLPDDSTHQLLQPHHDCCQRQEQPAAVGQSGVRRAPDSPVLEAVWDPPFHSG 
PPCCLGLVPVEEVDSPDSCQVSGGDWCPQHPVGAYVGQEPGMQLSPGPLVRVSFETPPLTI 

Signal peptide: 

amino acids 1-30 

Transmembrane domain : 

amino acids 16-30 (type II), 854-879 
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FIGURE 37 

CGGGAGGCTGGGTCGTCATGATCCGGACCCCATTGTCGGCCTCTGCCCATCGCCTGCTCCTC 
CCAGGCTCCCGCGGCCGACCCCCGCGCAAC ATG CAGCCCACGGGCCGCGAGGGTTCCCGCGC 
GCTCAGCCGGGGGTATCTGCGGCGTCTGCTGCTCCTGCTACTGCTGCTGCTGCTGCGGCAGC 
CCGTAACCCGCGCGGAGACCACGCCGGGCGCCCCCAGAGCCCTCTCCACGCTGGGCTCCCCC 
AGCCTCTTCACCACGCCGGGTGTCCCCAGCGCCCTCACTACCCCAGGCCTCACTACGCCAGG 
CACCCCCAAAACCCTGGACCTTCGGGGTCGCGCGCAGGCCCTGATGCGGAGTTTCCCACTCG 
TGGACGGCCACAATGACCTGCCCCAGGTCCTGAGACAGCGTTACAAGAATGTGCTTCAGGAT 
GTTAACCTGCGAAATTTCAGCCATGGTCAGACCAGCCTGGACAGGCTTAGAGACGGCCTCGT 
GGGTGCCCAGTTCTGGTCAGCCTCCGTCTCATGCCAGTCCCAGGACCAGACTGCCGTGCGCC 
TCGCCCTGGAGCAGATTGACCTCATTCACCGCATGTGTGCCTCCTACTCTGAACTCGAGCTT 
GTGACCTCAGCTGAAGGTCTGAACAGCTCTCAAAAGCTGGCCTGCCTCATTGGCGTGNAGGG 
TGGTCACTCACTGGACAGCAGCCTCTCTGTGCTGCGCAGTTTCTATGTGCTGGGGGTGCGCT 
ACCTGACACTTACCTTCACCTGCAGTACACCATGGGCAGAGAGTTCCACCAAGTTCAGACAC 
CACATGTACACCAACGTCAGCGGATTGACAAGCTTTGGTGAGAAAGTAGTAGAGGAGTTGAA 
CCGCCTGGGCATGATGATAGATTTGTCCTATGCATCGGACACCTTGATAAGAAGGGTCCTGG 
AAGTGTCTCAGGCTCCTGTGATCTTCTCCCACTCAGCTGCCAGAGCTGTGTGTGACAATTTG 
TTGAATGTTCCCGATGATATCCTGCAGCTTCTGAAGAACGGTGGCATCGTGATGGTGACACT 
GTCCATGGGGGTGCTGCAGTGCAACCTGCTTGCTAACGTGTCCACTGTGGCAGATCACTTTG 
ACCACATCAGGGCAGTCATTGGATCTGAGTTCATCGGGATTGGTGGAAATTATGACGGGACT 
GGCCGGTTCCCTCAGGGGCTGGAGGATGTGTCCACATACCCAGTCCTGATAGAGGAGTTGCT 
GAGTCGTASCTGGAGCGAGGAAGAGCTTCAAGGTGTCCTTCGTGGAAACCTGCTGCGGGTCT 
TCAGACAAGTGGAAAAGGTGAGAGAGGAGAGCAGGGCGCAGAGCCCCGTGGAGGCTGAGTTT 
CCATATGGGCAACTGAGCACATCCTGCCACTCCCACCTCGTGCCTCAGAATGGACACCAGGC 
TACTCATCTGGAGGTGACCAAGCAGCCAACCAATCGGGTCCCCTGGAGGTCCTCAAATGCCT 
CCCCATACCTTGTTCCAGGCCTTGTGGCTGCTGCCACCATCCCAACCTTCACCCAGTGGCTC 
TGCTTGACACAGTCGGTCCCCGCAGAGGTCACTGTGGCAAAGCCTCACAAAGCCCCCTCTCCT 
AGTTCATTCACAAGCATATGCTGAGAATAAACATGTTACACATGGAAAA 
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FIGURE 38 

></usr/segdb2/sst/DNA/Dnaseqs.min/ss.DNA59817 
xsubunit 1 of 1, 487 aa, 1 stop, 2 unknown 
><MW: 53569.32, pi: 7.68, NX(S/T): 5 

MQPTGREGSRALSRRYLRRLLLLLLLLLLRQPVTRAETTPGAPRALSTLGSPSLFTTPGVPS 
ALTTPGLTTPGTPKTLDLRGRAQALMRSFPLVDGHNDLPQVLRQRYKNVLQDVNLRNFSHGQ 
TS LDRLRDGL VG AQ FWS AS VS CQSQD QT AVRLALEQ I DL I HRMCAS YSELE L VTS AEGLNSS 
QKLACLIGVXGGHSLDSSLSVLRSFYVLGVRYLTLTFTCSTPWAESSTKFRHHMYTNVSGLT 
SFGEKVVEELNRLGMMIDLSYASDTLIRRVLEVSQAPVIFSHSAARAVCDNLLNVPDDILQL 
LKNGGIVMVTLSMGVLQCNLLANVSTVADHFDHIRAVIGSEFIGIGGNYDGTGRFPQGLEDV 
STYPVLIEELLSRXWSEEELQGVLRGNLLRVFRQVEKVREESRAQSPVEAEFPYGQLSTSCH 
SHLVPQNGHQATHLEVTKQPTNRVPWRSSNASPYLVPGLVAAATIPTFTQWLC 

Important features of the protein: 
Signal peptide: 
amino acids 1-36 

Transmembrane domain: 

amino acids 313-331 

N-glycosylation sites. 

amino acids 119-122, 184-187, 243-246 and 333-336 
N-myristoylation sites. 

amino acids 41-46, 59-64, 73-78, 133-138, 182-187, 194-199, 324- 
329, 354-359, 357-362, 394-399, 427-432 and 472-477. 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 136-146 
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FTGTJRE 39 

TGCTAGGCTCTGTCCCACAATGCACCCGAGAGCAGGAGCTGAAAGCCTCTAACACCCACAGA 
TCCCTCTATGACTGCAATGTGAGGTGTCCGGCTTTGCTGGCCCAGCAAGCCTGATAAGCATG 
AAGCTCTTATCTTTGGTGGCTGTGGTCGGGTGTTTGCTGGTGCCCCCAGCTGAAGCCAACAA 
GAGTTCTGAAGATATCCGGTGCAAATGCATCTGTCCACCTTATAGAAACATCAGTGGGCACA 
TTTACAACCAGAATGTATCCCAGAAGGACTGCAACTGCCTGCACGTGGTGGAGCCCATGCCA 
GTGCCTGGCCATGACGTGGAGGCCTACTGCCTGCTGTGCGAGTGCAGGTACGAGGAGCGCAG 
CACCACCACCATCAAGGTCATCATTGTCATCTACCTGTCCGTGGTGGGTGCCCTGTTGCTCT 
ACATGGCCTTCCTGATGCTGGTGGACCCTCTGATCCGAAAGCCGGATGCATACACTGAGCAA 
CTGCACAATGAGGAGGAGAATGAGGATGCTCGCTCTATGGCAGCAGCTGCTGCATCCCTCGG 
GGGACCCCGAGCAAACACAGTCCTGGAGCGTGTGGAAGGTGCCCAGCAGCGGTGGAAGCTGC 
AGGTGCAGGAGCAGCGGAAGACAGTCTTCGATCGGCACAAGATGCTCAGCTAGATGGGCTGG 
TGTGGTTGGGTCAAGGCCCCAACACCATGGCTGCCAGCTTCCAGGCTGGACAAAGCAGGGGG 
CTACTTCTCCCTTCCCTCGGTTCCAGTCTTCCCTTTAAAAGCCTGTGGCATTTTTCCTCCTT 
CTCCCTAACTTTAGAAATGTTGTACTTGGCTATTTTGATTAGGGAAGAGGGATGTGGTCTCT 
GATCTCTGTTGTCTTCTTGGGTCTTTGGGGTTGAAGGGAGGGGGAAGGCAGGCCAGAAGGGA 
ATGGAGACATTCGAGGCGGCCTCAGGAGTGGATGCGATCTGTCTCTCCTGGCTCCACTCTTG 
CCGCCTTCCAGCTCTGAGTCTTGGGAATGTTGTTACCCTTGGAAGATAAAGCTGGGTCTTCA 
GGAACTCAGTGTCTGGGAGGAAAGCATGGCCCAGCATTCAGCATGTGTTCCTTTCTGCAGTG 
GTTCTTATCACCACCTCCCTCCCAGCCCCGGCGCCTCAGCCCCAGCCCCAGCTCCAGCCCTG 
AGGACAGCTCTGATGGGAGAGCTGGGCCCCCTGAGCCCACTGGGTCTTCAGGGTGCACTGGA 
AGCTGGTGTTCGCTGTCCCCTGTGCACTTCTCGCACTGGGGCATGGAGTGCCCATGCATACT 
CTGCTGCCGGTCCCCTCACCTGCACTTGAGGGGTCTGGGCAGTCCCTCCTCTCCCCAGTGTC 
CACAGTCACTGAGCCAGACGGTCGGTTGGAACATGAGACTCGAGGCTGAGCGTGGATCTGAA 
CACCACAGCCCCTGTACTTGGGTTGCCTCTTGTCCCTGAACTTCGTTGTACCAGTGCATGGA 
GAGAAAATTTTGTCCTCTTGTCTTAGAGTTGTGTGTAAATCAAGGAAGCCATCATTAAATTG 
TTTTATTTCTCTCA 
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FIGURE 40 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA60278 
<subunit 1 of 1, 183 aa, 1 stop 
<MW: 20574, pi: 6.60, NX{S/T): 3 

MKLLSLVAWGCLLVPPAEANKSSEDIRCKCICPPYRNISGHIYNQNVSQKDCNCLHWEPM 
PVPGHDVEAYCLLCECRYEERSTTTIKVIIVIYLSWGALLLYMAFLMLVDPLIRKPDAYTE 
QLHNEEENEDARS^^AAASLGGPRANTVLERVEGAQQRWKLQVQEQRKTVFDRHKMLS 

Important features: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 90-112 

N-glycosylation sites. 

amino acids 21-24, 38-41 and 47-50 
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FIGURE 41 

AGCGGGTCTCGCTTGGGTTCCGCTAATTTCTGTCCTGAGGCGTGAGACTGAGTTCATAGGGTCCTGGGTCCCCGA 
ACCAGGAAGGGTTGAGGGAACACAATCTGCAAGCCCCCGCGACCCAAGTGAGGGGCCCCGTGTTGGGGTCCTCCC 
TCCCTTTGC^TTCCCACCCCTCCGGGCTTTGCGTCTTCCTGGGGACCCCCTCGCCGGGAGATGGCCGCGTTGATG 
CGGAGCAAGGATTCGTCCTGCTGCCTGCTCCTACTGGCCGCGGTGCTGATGGTGGAGAGCTCACAGATCGGCAGT 
TCGCGGGCCAAACTCAACTCCATCAAGTCCTCTCTGGGCGGGGAGACGCCTGGTCAGGCCGCCAATCGATCTGCG 
GGCATGTACCAAGGACTGGCATTCGGCGGCAGTAAGAAGGGCAAAAACCTGGGGCAGGCCTACCCTTGTAGCAGT 
GATAAGGAGTGTGAAGTTGGGAGGTATTGCCACAGTCCCCACCAAGGATCATCGGCCTGCATGGTGTGTCGGAGA 
AAAAAGAAGCGCTGCCACCGAGATGGC^TGTGCTGCCCCAGTACCCGCTGC^TAATGGCATCTGTATCCCAGTT 
ACTGAAAGCATCTTAACCCCTCACATCCCGGCTCTGGATGGTACTCGGCACAGAGATCGAAACCACGGTCATTAC 
TC7LAACCATGACTTGGGATGGCAGAATCH71GGAA 

GACCCCTGCCTACGATCATCAGACTGCATTGAAGGGTTTTGCTGTGCTCGTCATTTCTGGACCAAAATCTGCAAA 
CCAGTGCTCCATCAGGGGGAAGTCTGTACCAAAGF^ 

TGCGACTGTGCGAAGGGCCTGTCTTGCAAAGTATGGAAAGATGCCACCTACTCCTCCAAAGCCAGACTCCATGTG 
TGTCAGAAAATTTGATCACCATTGAGGAACATCATC^ 

CATGGTGGAAAATAAGGTTCAGATGCAGAAGAATGGCTAAAATAAGAAACGTGATAAGAATATAGATGATCACAA 
AAAGGGAGAAAGAAAACATGAACTGAATAGATTAGAATGGGTGACAAATGC^GTGC^GCCAGTGTTTCCATTATG 
C AACTTGTCTATGTAAATAATGTAC ACATTTGTGGAAAATGCTATT AT TAAG AGAACAAG CAC ACAGTGGAAATT 
ACTGATGAG TAGCATG TGACTTTCCAAGAGTTTAGGTTG TGCTGGAGG AGAGGTTTCCTTCAG ATTG CTG ATTGC 
TTATACAAATAACCTACATGCCAGATTTCTATTCAACGTTAGAGTTTAACAAAATACTCCTAGAATAACTTGTTA 
TACAATAGGTTCTAAAAATAAAATTGCTAAACAAGAAATGAAAACATGGAGCATTGTTAATTTACAACAGAAAAT 
TACCTTTTGATTTGTAACACTACTTCTGCTGTTCAATCAAGAGTCTTGGTAGATAAGAAAAAAATCAGTCAATAT 
TTCCAAATAATTGCAAAATAATGGCCAGTTGTTTAGGAAGGCCTTTAGGAAGACAAATAAATAACAAACAAACAG 
CCACAAATACTTTTTTTTCAAAATTTTAGTTTTACCT 

TTCAGATTCTACGGAATGACAGTATATCTCTCTTTATCCTATGTGATTCCTGCTCTGAATGCATTATATTTTCCA 
AACTATACCCATAAATTGTGACTAGTAAAATACTTACACAGA 

G ATGTC CAAT ATATGTGGGAAAAGAGCTAACAGAGAGATCATTATTTCTTAAAGATTGG CCATAACCTATATTT^ 

GATAGAATTAGATTGGTAAATACATGTATTCATACATACTCTGTGGTAATAGAGACTTAAGCTGGATCTGTACTG 

CACTGGAGTAAGCAAGAAAATTGGGAAAACTTTTTCGTTTGTTC^GGTTTTGGCAACACATAGATCATATGTCTG 

AGGCACAAGTTGGCTGTTOITCTTTGAAACCAGGGGATGCACAGTCTAAATGAATATCTGCATGGGATTTGCTAT 

CATAATATTTACTATGCAGATGAATTCAGTGTGAGGTCCTGTGTCCGTACTATCCTCAAATTATTTATTTTATAG 

TGCTGAGATCCTCAAATAATCTCAATTTCAGGAGGTTTCACAAAATGTACTCCTGAAGTAGACAGAGTAGTGAGG 

TTTCATTGCCCTCTATAAGCTTCTGACTAGCCAATGGCATCATCCAATTTTCTTCCCAAACCT 

CTTTATTGCCAAAGGGCTAGTTTCGGTTTTCTGCAGCCATTGCGGTTAAAAAATATAAGTAGGATAACTTGTA 

ACCTGCATATTGCTAATCTATAGACACCACAGTTTCTAAATTCTTTGAAACCACTTTACTACTTTTTTTAAACTT 

AACTCAGTTCTAAATACTTTGTCTGGAGCACAAAACAATAAAAGGTTATCTTATAGTCGTGACTTTAAACTTTTG 

TAGACGACAATTCACTTTTTAGTTTTCTTTT^ 

AGATTGAGTTTGAGCCTGTATATCTATTAAAAATTTCAACTTCCCACATATATTTACTAAGATGATTAAGAOT 
CATTTTCTGCACAGGTCTGCAAAAACAAAAATTAT^^ 

TGCTATAAGCTTGTGAAATGAAAATGGAACATTTCAATCAAACATTTCCTATATAACAAT^ 

TTGGTTTCTGCAATATTTTTCTTATGTCCACCCTTTTAAAAATTATTATTTG 

TTAATGAGATGTATTTTCTTATAGAGATATTTCTTACAGAAAGCTTTGTAGCAGA^ 

TTTGTAATTTAGGAAAAATGTATAATAAGATAAAATCTATTAAATTTTTCTCCTCTAAAAACTGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAA 
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FTGIJRE 42 

MAALMRSraSSCCLLLLAAVLMVESSQIGSSRAKLNSIKSSLGGETPGQAANRSAGMYQGLA 
FGGSKKGKNLGQAYPCSSDKECEVGRYCHSPHQGSSACMVCRRKKKRCHRDGMCCPSTRCNN 
GICIPVTESILTPHIPALDGTRHRDRNHGHYSNHDLGWQNLGRPHTKMSHIKGHEGDPCLRS 
SDCIEGFCCARHFWTKICKPVLHQGEVCTKQRKKGSHGLEIFQRCDCAKGLSCKVWKDATYS 
SKARLHVCQKI 

Signal peptide: 

amino acids 1-25 
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FIGURE 44 

MCFKALGI^SVLLRICSFIPLLKSSVLGSGFGELAPPKMANITSSQILDQLKAPSIjGQFTTT 
PSTQQNSTSHPTTTTSWDLKPPTSQSSVLSHLDFKSQPEPSPVLSQLSQRQQHQSQAVTVPP 
PGLESFPSQAKLRESTPGDSPSTVNKLLQLPSTTIENISVSVHQPQPKHIKLAKRRIPPASK 
IPASAVEMPGSADVTGLNVQFGALEFGSEPSLSEFGSAPSSENSNQIPISLYSKSLSEPLNT 
SLSMTSAVQNSTYTTSVITSCSLTSSSLNSASPVAMSSSYDQSSVHNRIPYQSPVSSSESAP 
GT I MNGHGGGR S QQTLDS KYS S KLLL S WLVPTKQRKRI AHVMWKT P VGQWL I R 



Signal peptide: 

amino acids 1-24 
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FIGURE 45 

GCCGAGTGGGACAAAGCCTGGGGCTGGGCGGGGGCCATGGCGCTGCCATCCCGAATCCTGCT 
TTGGAAACTTGTGCTTCTGCAGAGCTCTGCTGTTCTCCTGCACTCAGCGGTGGAGGAGACGG 
ACGCGGGGCTGTACACCTGCAACCTGCACCATCACTACTGCCACCTCTACGAGAGCCTGGCC 
GTCCGCCTGGAGGTCACCGACGGCCCCCCGGCCACCCCCGCCTACTGGGACGGCGAGAAGGA 
GGTGCTGGCGGTGGCGCGCGGCGCACCCGCGCTTCTGACCTGCGTGAACCGCGGGCACGTGT 
GGACCGACCGGCACGTGGAGGAGGCTCAACAGGTGGTGCACTGGGACCGG CAGC CGCCCGGG 
GTCCCGCACGACCGCGCGGACCGCCTGCTGGACCTCTACGCGTCGGGCGAGCGCCGCGCCTA 
CGGGCCCCTTTTTCTGCGCGACCGCGTGGCTGTGGGCGCGGATGCCTTTGAGCGCGGTGACT 
TCTCACTGCGTATCGAGCCGCTGGAGGTCGCCGACGAGGGCACCTACTCCTGCCACCTGCAC 
CACCATTACTGTGGCCTGCACGAACGCCGCGTCTTCCACCTGACGGTCGCCGAACCCCACGC 
GGAGCCGCCCCCCCGGGGCTCTCCGGGCAACGGCTCCAGCCACAGCGGCGCCCCAGGCCCAG 
ACCCCACACTGGCGCGCGGCCACAACGTCATCAATGTCATCGTCCCCGAGAGCCGAGCCCAC 
TTCTTCCAGCAGCTGGGCTACGTGCTGGCCACGCTGCTGCTCTTCATCCTGCTACTGGTCAC 
TGTCCTCCTGGCCGCCCGCAGGCGCCGCGGAGGCTACGAATACTCGGACCAGAAGTCGGGAA 
AGTCAAAGGGGAAGGATGTTAACTTGGCGGAGTTCGCTGTGGCTGCAGGGGACCAGATGCTT 
TACAGGAGTGAGGACATCCAGCTAGATTACAAAAACAACATCCTGAAGGAGAGGGCGGAGCT 
GGCCCACAGCCCCCTGCCTGCCAAGTACATCGACCTAGACAAAGGGTTCCGGAAGGAGAACT 
GCAAATAGGGAGGCCCTGGGCTCCTGGCTGGGCCAGCAGCTGCACCTCTCCTGTCTGTGCTC 
CTCGGGGCATCTCCTGATGCTCCGGGGCTCACCCCCCTTCCAGCGGCTGGTCCCGCTTTCCT 
GGAATTTGGCCTGGGCGTATGCAGAGGCCGCCTCCACACCCCTCCCCCAGGGGCTTGGTGGC 
AGCATAGCCCCCACCCCTGCGGCCTTTGCTCACGGGTGGCCCTGCCCACCCCTGGCACAACC 
AAAATCCCACTGATGCCCATCATGCCCTCAGACCCTTCTGGGCTCTGCCCGCTGGGGGCCTG 
AAGACATTCCTGGAGGACACTC CCAT CAGAACCTGGCAGCCCCAAAACTGGGGT CAGCCTCA 
GGGCAGGAGTCCCACTCCTCCAGGGCTCTGCTCGTCCGGGGCTGGGAGATGTTCCTGGAGGA 
GGACACTCCCATCAGAACTTGGCAGCCTTGAAGTTGGGGTCAGCCTCGGCAGGAGTCCCACT 
CCTCCTGGGGTGCTGCCTGCCACCAAGAGCTCCCCCACCTGTACCACCATGTGGGACTCCAG 
GCACCATCTGTTCTCCCCAGGGACCTGCTGACTTGAATGCCAGCCCTTGCTCCTCTGTGTTG 
CTTTGGGCCACCTGGGGCTGCACCCCCTGCCCTTTCTCTGCCCCATCCCTACCCTAGCCTTG 
CTCTCAGCCACCTTGATAGTCACTGGGCTCCCTGTGACTTCTGACCCTGACACCCCTCCCTT 
GGACTCTGCCTGGGCTGGAGTCTAGGGCTGGGGCTACATTTGGCTTCTGTACTGGCTGAGGA 
CAGGGGAGGGAGTGAAGTTGGTTTGGGGTGGCCTGTGTTGCCACTCTCAGCACCCCACATTT 

GCATCTGCTGGTGGACCTGCCACCATCACAATAAAGTCCCCATCTGATTTTTAAAAAAAAAA 
AAAAA 
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FIGURE 46 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA60618 
oubunit 1 of l, 341 aa, 1 stop 
<MW: 38070, pi: 6.88, NX(S/T): 1 

MALPSRILLWKLVLLQSSAVLLHSAVEETDAGLYTCNLHHHYCHLYESLAVRLEVTDGPPAT 
PAYWDGEKEVLAVARGAPALLTCVNRGHVWTDRHVEEAQQVVHWDRQPPGVPHDRADRLLDL 
YASGERRAYGPLFLRDRVAVGADAFERGDFSLRIEPLEVADEGTYSCHLHHHYCGLHERRVF 
HLTVAEPHAEPPPRGSPGNGSSHSGAPGPDPTLARGHNVINVIVPESRAHFFQQLGYVLATL 
LLFILLLVTVLLAARRRRGGYEYSDQKSGKSKGKDVNLAEFAVAAGDQMLYRSEDIQLDYKN 
NILKERAELAHSPLPAKYIDLDKGFRKENCK 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 237-262 

N-glycosylation site. 

amino acids 205-208 

Cell attachment sequence. 

amino acids 151-154 

Coproporphyrinogen III oxidase proteins. 

amino acids 115-140 



tt /MO 
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FTGIJRE 47 

CGCCGGAGGCAGCGGCGGCGTGGCGCAGCGGCGACATGGCCGTTGTCTCAGAGGACGACTTT 
CAGCACAGTTCAAACTCCACCTACGGAACCACAAGCAGCAGTCTCCGAGCTGACCAGGAGGC 
ACTGCTTGAGAAGCTGCTGGACCGCCCGCCCCCTGGCCTGCAGAGGCCCGAGGACCGCTTCT 
GTGGCACATACATCATCTTCTTCAGCCTGGGCATTGGCAGTCTACTGCCATGGAACTTCTTT 
ATCACTGCCAAGGAGTACTGGATGTTCAAACTCCGCAACTCCTCCAGCCCAGCCACCGGGGA 
GGACCCTGAGGGCTCAGACATCCTGAACTACTTTGAGAGCTACCTTGCCGTTGCCTCCACCG 
TGCCCTCCATGCTGTGCCTGGTGGCCAACTTCCTGCTTGTCAACAGGGTTGCAGTCCACATC 
CGTGTCCTGGCCTCACTGACGGTCATCCTGGCCATCTTCATGGTGATAACTGCACTGGTGAA 
GGTGGACACTTCCTCCTGGACCCGTGGTTTTTTTGCGGTCACCATTGTCTGCATGGTGATCC 
TCAGCGGTGCCTCCACTGTCTTCAGCAGCAGCATCTACGGCATGACCGGCTCCTTTCCTATG 
AGGAACTCCCAAGCACTGATATCAGGAGGAGCCATGGGCGGGACGGTCAGCGCCGTGGCCTC 
ATTGGTGGACTTGGCTGCATCCAGTGATGTGAGGAACAGCGCCCTGGCCTTCTTCCTGACGG 
CCACCATCTTCCTCGTGCTCTGCATGGGACTCTACCTGCTGCTGTCCAGGCTGGAGTATGCC 
AGGTACTACATGAGGCCTGTTCTTGCGGCCCATGTGTTTTCTGGTGAAGAGGAGCTTCCCCA 
GGACTCCCTCAGTGCCCCTTCGGTGGCCTCCAGATTCATTGATTCCCACACACCCCCTCTCC 
GCCCCATCCTGAAGAAGACGGCCAGCCTGGGCTTCTGTGTCACCTACGTCTTCTTCATCACC 
AGCCTCATCTACCCCGCCGTCTGCACCAACATCGAGTCCCTCAACAAGGGCTCGGGCTCACT 
GTGGACCACCAAGTTTTTCATCCCCCTCACTACCTTCCTCCTGTACAACTTTGCTGACCTAT 
GTGGCCGGCAGCTCACCGCCTGGATCCAGGTGCCAGGGCCCAACAGCAAGGCGCTCCCAGGG 
TTCGTGCTCCTCCGGACCTGCCTCATCCCCCTCTTCGTGCTCTGTAACTACCAGCCCCGCGT 
CCACCTGAAGACTGTGGTCTTCCAGTCCGATGTGTACCCCGCACTCCTCAGCTCCCTGCTGG 
GGCTCAGCAACGGCTACCTCAGCACCCTGGCCCTCCTCTACGGGCCTAAGATTGTGCCCAGG 
GAGCTGGCTGAGGCCACGGGAGTGGTGATGTCCTTTTATGTGTGCTTGGGCTTAACACTGGG 
CTCAGCCTGCTCTAC C CTCC TGGTG CACCT CATCTAGAAGGGAGG ACACAAGGACATTGGTG 
CTTCAGAGCCTTTGAAGATGAGAAGAGAGTGCAGGAGGGCTGGGGGCCATGGAGGAAAGGCC 
TAAAGTTTCACTTGGGGACAGAGAGCAGAGCACACTCGGGCCTCATCCCTCCCAAGATGCCA 
GTGAGCCACGTC CATGCCCATT CCGTGC AAGGCAGATATTCCAGTCATATTAACAGAACACT 
CCTGAGACAGTTGAAGAAGAAATAGCACAAATCAGGGGTACTCCCTTCACAGCTGATGGTTA 
ACATTCCACCTTCTTTCTAGCCCTTCAAAGATGCTGCCAGTGTTCGCCCTAGAGTTATTACA 
AAGCCAGTGCCAAAACCCAGCCATGGGCTCTTTGCAACCTCCCAGCTGCGCTCATTCCAGCT 
GACAGCGAGATGCAAGCAAATGCTCAGCTCTCCTTACCCTGAAGGGGTCTCCCTGGAATGGA 
AGTCCCCTGGCATGGTCAGTCCTCAGGCCCAAGACTCAAGTGTGCACAGACCCCTGTGTTCT 
GCGGGTGAACAACTGCCCACTAACCAGACTGGAAAACCCAGAAAGATGGGCCTTCCATGAAT 
GCTTCATTCCAGAGGGACCAGAGGGCCTCCCTGTGCAAGGGATCAAGCATGTCTGGCCTGGG 
TTTTCAAAAAAAGAGGGATCCTCATGACCTGGTGGTCTATGGCCTGGGTCAAGATGAGGGTC 
TTTCAGTGTTCCTGTTTACAACATGTCAAAGCCATTGGTTCAAGGGCGTAATAAATACTTGC 
GTATTCAAAAA 
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FIGURE 48 

MAWSEDDFQHSSNSTYGTTSSSLRADQEALLEKLLDRPPPGLQRPEDRFCGTYIIFFSLGI 
GSLLPWNFFITAKEYWMFKLRNSSSPATGEDPEGSDILNYFESYLAVASTVPSMLCLVANFL 
LVNRVAVHIRVLASLTVILAIFMVITALVKVDTSSWTRGFFAVTIVCMVILSGASTVFSSSI 
YGMTGSFPMRNSQALISGGAMGGTVSAVASLVDLAASSDVRNSALAFFLTATIFLVLCMGLY 
LLLSRLEYARYYMRPVLAAHVFSGEEELPQDSLSAPSVASRFIDSHTPPLRPILKKTASLGF 
CVTYVFFITSLIYPAVCTNIESLNKGSGSLWTTKFFIPLTTFLLYNFADLCGRQLTAWIQVP 
GPNSKALPGFVLLRTCLIPLFVLCNYQPRVHLKTWFQSDVYPALLSSLLGLSNGYLSTLAL 
LYGPKIVPRELAEATGWMSFYVCLGLTLGSACSTLLVHLI 

Transmembrane domain: 

amino acids 50-74 (type II), 105-127, 135-153, 163-183, 228-252, 
305-330, 448-472 
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FIGURE 49 

GACAGTGGAGGGCAGTGGAGAGGACCGCGCTGTCCTGCTGTCACCAAGAGCTGGAGACACCA 
TCTCCCACCGAGAGTCATGGCCCCATTGGCCCTGCACCTCCTCGTCCTCGTCCCCATCCTCC 
TCAGCCTGGTGGCCTCCCAGGACTGGAAGGCTGAACGCAGCCAAGACCCCTTCGAGAAATGC 
ATGCAGGATCCTGACTATGAGCAGCTGCTCAAGGTGGTGACCTGGGGGCTCAATCGGACCCT 
GAAGCCCCAGAGGGTGATTGTGGTTGGCGCTGGTGTGGCCGGGCTGGTGGCCGCCAAGGTGC 
TCAGCGATGCTGGACACAAGGTCACCATCCTGGAGGCAGATAACAGGATCGGGGGCCGCATC 
TTCACCTACCGGGACCAGAACACGGGCTGGATTGGGGAGCTGGGAGCCATGCGCATGCCCAG 
CTCTCACAGGATCCTCCACAAGCTCTGCCAGGGCCTGGGGCTCAACCTGACCAAGTTCACCC 
AGTACGACAAGAACACGTGGACGGAGGTGCACGAAGTGAAGCTGCGCAACTATGTGGTGGAG 
AAGGTGCCCGAGAAGCTGGGCTACGCCTTGCGTCCCCAGGAAAAGGGCCACTCGCCCGAAGA 
CATCTACCAGATGGCTCTCAACCAGGCCCTCAAAGACCTCAAGGCACTGGGCTGCAGAAAGG 
CGATGAAGAAGTTTGAAAGGCACACGCTCTTGGAATATCTTCTCGGGGAGGGGAACCTGAGC 
CGGCCGGCCGTGCAGCTTCTGGGAGACGTGATGTCCGAGGATGGCTTCTTCTATCTCAGCTT 
CGCCGAGGCCCTCCGGGCCCACAGCTGCCTCAGCGACAGACTCCAGTACAGCCGCATCGTGG 
GTGGCTGGGACCTGCTGCCGCGCGCGCTGCTGAGCTCGCTGTCCGGGCTTGTGCTGTTGAAC 
GCGCCCGTGGTGGCGATGACCCAGGGACCGCACGATGTGCACGTGCAGATCGAGACCTCTCC 
CCCGGCGCGGAATCTGAAGGTGCTGAAGGCCGACGTGGTGCTGCTGACGGCGAGCGGACCGG 
CGGTGAAGCGCATCACCTTCTCGCCGCCGCTGCCCCGCCACATGCAGGAGGCGCTGCGGAGG 
CTGCACTACGTGCCGGCCACCAAGGTGTTCCTAAGCTTCCGCAGGCCCTTCTGGCGCGAGGA 
GCACATTGAAGGCGGCCACTCAAACACCGATCGCCCGTCGCGCATGATTTTCTACCCGCCGC 
CGCGCGAGGGCGCGCTGCTGCTGGCCTCGTACACGTGGTCGGACGCGGCGGCAGCGTTCGCC 
GGCTTGAGCCGGGAAGAGGCGTTGCGCTTGGCGCTCGACGACGTGGCGGCATTGCACGGGCC 
TGTCGTGCGCCAGCTCTGGGACGGCACCGGCGTCGTCAAGCGTTGGGCGGAGGACCAGCACA 
GCCAGGGTGGCTTTGTGGTACAGCCGCCGGCGCTCTGGCAAACCGAAAAGGATGACTGGACG 
GTCCCTTATGGCCGCATCTACTTTGCCGGCGAGCACACCGCCTACCCGCACGGCTGGGTGGA 
GACGGCGGTCAAGTCGGCGCTGCGCGCCGCCATCAAGATCAACAGCCGGAAGGGGCCTGCAT 
CGGACACGGCCAGCCCCGAGGGGCACGCATCTGACATGGAGGGGCAGGGGCATGTGCATGGG 
GTGGCCAGCAGCCCCTCGCATGACCTGGCAAAGGAAGAAGGCAGCCACCCTCCAGTCCAAGG 
CCAGTTATCTCTCCAAAACACGACCCACACGAGGACCTCGCATTAAAGTATTTTCGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 50 

MAPLALHLLVLVPILLSLVASQDWKAERSQDPFEKCMQDPDYEQLLKVVTWGLNRTLKPQRV 
I WGAGVAGLVAAKVLSDAGHKVT I LEADNR I GGRI FTYRDQNTGW I GELGAMRMP S S HR I L 
HKLCQGLGLNLTKFTQYDKNTWTEVHEVKLRlSn^EKVPEKLGYALRPQEKGHSPEDIYQMA 
LNQALKDLKALGCRKAMKKFERHTLLEYLLGEGNLSRPAVQLLGDVMSEDGFFYLSFAEALR 
AHSCLSDRLQYSRIVGGWDLLPRALLSSLSGLVLLNAPWAMTQGPHDVHVQIETSPPARNL 
KVLKADWLLTASGPAVKRITFSPPLPRHMQEALRRLHYVPATKVFLSFRRPFWREEHIEGG 
HSNTDRPSRMIFYPPPREGALLLASYTWSDAAAAFAGLSREEALRLALDDVAALHGPWRQL 
WDGTGWKRWAEDQHSQGGFWQPPALWQTEKDDWTVPYGRIYFAGEHTAYPHGWVETAVKS 
ALRAAIKINSRKGPASDTASPEGHASDMEGQGHVHGVASSPSHDLAKEEGSHPPVQGQLSLQ 
NTTHTRTSH 

Signal peptide: 

amino acids 1-21 
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FIGURE 51 

CTGACATGGCCTGACT CGGG AC AG CTCAGAG CAGGG CAG AACTGGGGACACT CTGGGC CGGCCTTCTG 
CCTGCATGGACGCTCTGAAGCCACCCTGTCTCTGGAGGAACCACGAGCGAGGGAAGAAGGACAGGGAC 
TCGTGTGGCAGGAAGAACTCAGAGCCGGGAAGCCCCCATTCACTAGAAGCACTGAGAGATGCGGCCCC 
CTCGCAGGGTCTGAATTTCCTGCTGCTGTTCACAAAGATGCTTTTTATCTTTAACTTTTTGTTTTCCC 
CACTTCCGACCCCGGCGTTGATCTGCATCCTGACATTTGGAGCTGCCATCTTCTTGTGGCTGATCACC 
AGAC CTCAACCCGTCTTACCTCTTCTTGACCTGAACAAT CAGT CTGTGGGAATTG AGGGAGGAGCACG 
GAAGGGGGTTTCCCAGAAGAACAATGACCTAACAAGTTGCTGCTTCTCAGATGCCAAGACTATGTATG 
AGGTTTTCCAAAGAGGACTCGCTGTGTCTGACAATGGG C CCTGCTTGGG AT ATAGAAAAC CAAACCAG 
CCCTACAGATGGCTATCTTACAAACAGGTGTCTGATAGAGCAGAGTACCTGGGTTCCTGTCTCTTGCA 
T AAAGGTTATAAATCATCACCAGAC CAGTTTGTCGGCAT CTTTGCTCAGAATAGGCCAGAGTGGATCA 
TCTCCGAATTGGCTTGTTACACGTACTCTATGGTAGCTGTACCTCTGTATGACACCTTGGGACCAGAA 
GCCATCGTACAT ATTGTCAACAAGGCTGATATCG C CATGGTGATCTGTGACACACCC CAAAAGG CATT 
GGTG CTGATAGGGAATGTAGAGAAAGG CTTCAC CCCG AGCCTGAAGGTGATC AT CCTT ATGGAC CC CT 
TTGATGATGACCTGAAGCAAAGAGGGGAGAAGAGTGGAATTGAGATCTTATCCCTATATGATGCTGAG 
AACCTAGGCAAAGAGCACTTCAGAAAACCTGTGCCTCCTAGCCCAGAAGACCTGAGCGTCATCTGCTT 
CACC AGT GGGACC AC AGGTG ACC CCAAAGGAGCCATG ATAACCCATCAAAATATTGTTTCAAATG CTG 
CTGCCTTTCTCAAATGTGTGGAGCATGCTTATGAGCCCACTCCTGATGATGTGGCCATATCCTACCTC 
C CTC TGG CT C AT ATGT TTG AGAGG ATTGTACAGG CTGTTGTGTACAG CTGTGGAG CCAGAGTTGG ATT 
CTTCCAAGGGGATATTCGGTTGCTGGCTGACGACATGAAGACTTTGAAGCCCACATTGTTTCCCGCGG 
TGCCT CG ACT C CTT AACAGGAT CT ACGATAAGGT ACAAAATG AGGC C AAG ACACCCTTGAAGAAGTT C 
TTGTTG AAG CTGGCTGTTTCCAGTAAATTCAAAG AG CTTCAAAAGGGTAT CATCAGGCATGATAGTTT 
CTGGGACAAGCTC^TCTTTGCAAAGATCCAGGACAGCCTGGGCGGAAGGGTTCGTGTAATTGTCACTG 
GAGCTG C CC CCATGTC CACTTC AGTC ATGACATT CTTCCGGGCAG CAATGGGATGTCAGGTGTATGAA 
GCTTATGGTCAAACAGAATGCACAGGTGGCTGTACATTTACATTACCTGGGGACTGGACATCAGGTCA 
CGTTGGGGTGCCCCTGGCTTGCAATTACGTGAAGCTGGAAGATGTGGCTGACATGAACTACTTTACAG 
TGAATAATGAAGGAGAGGTCTGCATCAAGGGTACAAACGTGTTCAAAGGATACCTGAAGGACCCTGAG 
AAGACACAGGAAGCCCTGGACAGTGATGGCTGGCTTCACACAGGAGACATTGGTCGCTGGCTCCCGAA 
TGGAACTCTGAAGATCATCG ACCGTAAAAAGAACATTTT CAAGCTGG C C CAAGGAGAAT ACATTGCAC 
C AGAGAAG AT AGAAAATAT CTACAACAGGAGT CAAC CAGTGTT ACAAATTTTTGTACACGGGGAGAG C 
TT ACGGTCATCCTT AGTAGGAGTGGTGGTTCCTGACACAGATGTACTT C C CTCATTTG CAG CCAAGCT 
TGGGGTGAAGGGCTCCTTTGAGGAACTGTGCCAAAACCAAGTTGTAAGGGAAGCCATTTTAGAAGACT 
TGCAGAAAATTGGGAAAGAAAGTGGCCTTAAAACTTTTGAACAGGTCAAAGCCATTTTTCTTCATCCA 
GAG CCATTTT C CATTGAAAATGGGCTCTTGACACCAACATTGAAAGCAAAGCGAGGAG AG CTTT CC AA 
ATACTTTCGGACCCAAATTGACAGCCTGTATGAGCACATCCAGGATTAGGATAAGGTACTTAAGTACC 
TG C CGGC CCACTGTG CACTGCTTGTGAGAAAATGGATTAAAAACTATTCTTACATTTGTTTTGC CTTT 
CCTCCTATTTTTTTTTAACCTGTTAAACTCTAAAGCCATAGCTTTTGTTTTATATTGAGACATATAAT 
GTGT AAACTTAGTTCC CAAATAAATCAATCCTGTCTTTCCCATCTT CGATGT TG CT AAT ATTAAGG CT 
T CAGGGCTACTTTTATC AACATG CC TGTCTT CAAGATCCCAGTTTATGTTCTGTGTCCTTCCTCATGA 
TTTCC AAC CTTAATACTATTAGTAACCACAAGTTCAAGGGTCAAAGGG AC CCTCTGTG CCTT CTTCTT 
TGTTTTGTG AT AAAC AT AACTTG CCAACAGTCT CTATGCTTATTTACATCrTCTACTGTT CAAA 
G AGATTTTTAAATT CTGAAAAACTGCTTACAATTCATGTTTT CTAG CCAC TC CACAAACCACTAAAAT 
TTT AGTT TT AG CCTATC ACTC ATGTCAAT CATATCTATGAGACAAATGTCTCCGATGCT CTTCTGCGT 
AAATT AAATTGTGTACTGAAGGGAAAAGTTTGAT CATACC AAACATTT CCTAAACTCTCTAGTT AGAT 
AT CTGACTTG GGAGT ATTAAAAATTGGGTCTATGACATACTGTCCAAAAGGAATGCTGTTCTTAAAG C 
ATTATTTACAGTAGGAACTGGGGAGTAAAT CTGT TCCCTACAGTTTGCTGCTG AGCTGGAAGCTGTGG 
GGGAAGGAGTTGACAGGTGGGCCCAGTGAACTTTTCCAGTAAATGAAGCAAGC^CTGAATAAAAACCT 
CCTGAACTGGGAACAAAGATCTACAGG C AAGC AAGATG CCCACACAACAGG CTTATTTTCTGTGAAGG 
AAC CAACTGATCTC CCC CAC CCTTGGATTAGAGTTCCTGCTCTACCTT AC CCACAGATAAC ACATGT T 
GTTTCTACTTGTAAATGTAAAGTCTTTAAAATAAACTATTACAGATAAAAAA 
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FTGTJRE 52 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA60775 
<subunit 1 of 1, 739 aa, 1 stop 
<MW: 82263, pi: 7.55 # NX(S/T): 3 

MDALKPP CLWRNHERGKKDRDS CGRKNS E PGS PHS LE ALRDAAP S QGLNFLLLFTKMLF I FN 
FLFSPLPTPALICILTFGAAIFLWLITRPQPVLPLLDLNNQSVGIEGGARKGVSQKNNDLTS 
CC FSD AKTMYE VFQRGLAVS DNGP CLG YRKPNQ PYRWL S YKQ VSDRAE YLGS CLLHKG YKS S 
PDQFVGIFAQNRPEWIISELACYTYSMVAVPLYDTLGPEAIVHIVNKADIAMVICDTPQKAL 
VLIGNVEKGFTPSLKVIILMDPFDDDLKQRGEKSGIEILSLYDAENLGKEHFRKPVPPSPED 
LSVICFTSGTTGDPKGAMITHQNIVSNAAAFLKCVEHAYEPTPDDVAISYLPLAHMFERIVQ 
AWYSCGARVGFFQGDIRLLADDMKTLKPTLFPAVPRLLNRIYDKVQNEAKTPLKKFLLKLA 
VSSKFKELQKGIIRHDSFWDKLIFAKIQDSLGGRVRVIVTGAAPMSTSVMTFFRAAMGCQVY 
EAYGQTECTGGCTFTLPGDWTSGHVGVPLACNYVKLEDVADMNYFTVNNEGEVCIKGTNVFK 
GYLKDPEKTQEALDSDGWLHTGDIGRWLPNGTLKIIDRKKNIFKLAQGEYIAPEKIENIYNR 
SQPVLQIFVHGESLRSSLVGWVPDTDVLPSFAAKLGVKGSFEELCQNQWREAILEDLQKI 
GKESGLKTFEQVKAIFLHPEPFSIENGLLTPTLKAKRGELSKYFRTQIDSLYEHIQD 

Important features: 

Type II transmembrane domain: 

amino acids 61-80 

Putative AMP -binding domain signature. 

amino acids 314-325 

N-glycosylation site. 

amino acids 102-105, 588-591 and 619-622 
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FTGTJRE 53 

GGAGGCGGAGGCCGCGGCGAGCCGGGCCGAGCAGTGAGGGCCCTAGCGGGGCCCGAGCGGGG 

CCCGGGGCCCCTAAGCCATTCCTGAAGTCATGGGCTGGCCAGGACATTGGTGACCCGCCAAT 

CCGGTATGGACGACTGGAAGCCCAGCCCCCTCATCAAGCCCTTTGGGGCTCGGAAGAAGCGG 

AGCTGGTACCTTACCTGGAAGTATAAACTGACAAACCAGCGGGCCCTGCGGAGATTCTGTCA 

GACAGGGGCCGTGCTTTTCCTGCTGGTGACTGTCATTGTCAATATCAAGTTGATCCTGGACA 

CTCGGCGAGCCATCAGTGAAGCCAATGAAGACCCAGAGCCAGAGCAAGACTATGATGAGGCC 

CTAGGCCGCCTGGAGCCCCCACGGCGCAGAGGCAGTGGTCCCCGGCGGGTCCTGGACGTAGA 

GGTGTATTCAAGTCGCAGCAAAGTATATGTGGCAGTGGATGGCACCACGGTGCTGGAGGATG 

AGGCCCGGGAGCAGGGCCGGGGCATCCATGTCATTGTCCTCAACCAGGCCACGGGCCACGTG 

ATGGCAAAACGTGTGTTTGACACGTACTCACCTCATGAGGATGAGGCCATGGTGCTATTCCT 

CAACATGGTAGCGCCCGGCCGAGTGCTCATCTGCACTGTCAAGGATGAGGGCTCCTTCCACC 

TCAAGGACACAGCCAAGGCTCTGCTGAGGAGCCTGGGCAGCCAGGCTGGCCCTGCCCTGGGC 

TGGAGGGACACATGGG CCTT CGTGGGACGAAAAGGAGGTCCTGTCTTCGGGGAGAAAC ATTC 

TAAGTCACCTGCCCTCTCTTCCTGGGGGGACCCAGTCCTGCTGAAGACAGATGTGCCATTGA 

GCTCAGCAGAAGAGGCAGAGTGCCACTGGGCAGACACAGAGCTGAACCGTCGCCGCCGGCGC 

TTCTGCAGCAAAGTTGAGGGCTATGGAAGTGTATGCAGCTGCAAGGACCCCACACCCATCGA 

GTTCAGCCCTGACCCACTCCCAGACAACAAGGTCCTCAATGTGCCTGTGGCTGTCATTGCAG 

GGAACCGACCCAATTACCTGTACAGGATGCTGCGCTCTCTGCTTTCAGCCCAGGGGGTGTCT 

CCTCAGATGATAACAGTTTTCATTGACGGCTACTATGAGGAACCCATGGATGTGGTGGCACT 

GTTTGGTCTGAGGGGCATCCAGCATACTCCCATCAGCATCAAGAATGCCCGCGTGTCTCAGC 

ACTACAAGGCCAGCCTCACTGCCACTTTCAACCTGTTTCCGGAGGCCAAGTTTGCTGTGGTT 

CTGGAAGAGGACCTGGACATTGCTGTGGATTTTTTCAGTTTCCTGAGCCAATCCATCCACCT 

ACTGGAGGAGGATGACAGCCTGTACTGCATCTCTGCCTGGAATGACCAGGGGTATGAACACA 

CGGCTGAGGACCCAGCACTACTGTACCGTGTGGAGACCATGCCTGGGCTGGGCTGGGTGCTC 

AGGAGGTCCTTGTACAAGGAGGAGCTTGAGCCCAAGTGGCCTACACCGGAAAAGCTCTGGGA 

TTGGGACATGTGGATGCGGATGCCTGAACAACGCCGGGGCCGAGAGTGCATCATCCCTGACG 

TTTCCCGATCCTACCACTTTGGCATCGTCGGCCTCAACATGAATGGCTACTTTCACGAGGCC 

TACTTCAAGAAGCACAAGTTCAACACGGTTCCAGGTGTCCAGCTCAGGAATGTGGACAGTCT 

GAAGAAAGAAGCTTATGAAGTGGAAGTTCACAGGCTGCTCAGTGAGGCTGAGGTTCTGGACC 

ACAGCAAGAACCCTTGTGAAGACTCTTTCCTGCCAGACACAGAGGGCCACACCTACGTGGCC 

TTTATTCGAATGGAGAAAGATGATGACTTCACCACCTGGACCCAGCTTGCCAAGTGCCTCCA 

TATCTGGGACCTGGATGTGCGTGGCAACCATCGGGGCCTGTGGAGATTGTTTCGGAAGAAGA 

ACCACTTCCTGGTGGTGGGGGTCCCGGCTTCCCCCTACTCAGTGAAGAAGCCACCCTCAGTC 

ACCCCAATTTTCCTGGAGCCACCCCCAAAGGAGGAGGGAGCCCCAGGAGCCCCAGAACAGAC 

ATGAGACCTCCTCCAGGACCCTGCGGGGCTGGGTACTGTGTACCCCCAGGCTGGCTAGCCCT 

TCCCTCCATCCTGTAGGATTTTGTAGATGCTGGTAGGGGCTGGGGCTACCTTGTTTTTAACA 

TGAGACTTAATTACTAACTCCAAGGGGAGGGTTCCCCTGCTCCAACACCCCGTTCCTGAGTT 

AAAAGTCTATTTATTTACTTCCTTGTTGGAGAAGGGCAGGAGAGTACCTGGGAATCATTACG 

ATCCCTAGCAGCTCATCCTGCCCTTTGAATACCCTCACTTTCCAGGCCTGGCTCAGAATCTA 

ACCTATTTATTGACTGTCCTGAGGGCCTTGAAAACAGGCCGAACCTGGAGGGCCTGGATTTC 

TTTTTGGGCTGGAATGCTGCCCTGAGGGTGGGGCTGGCTCTTACTCAGGAAACTGCTGTG.ee 

CAACCCATGGACAGGCCCAGCTGGGGCCCACATGCTGACACAGACTCACTCAGAGACCCTTA 

GACACTGGACCAGGCCTCCTCTCAGCCTTCTCTTTGTCCAGATTTCCAAAGCTGGATAAGTT 

GGTCATTGATTAAAAAAGGAGAAGCCCTCTGGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FTGTJRE 54 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA61185 
xsubunit 1 of 1, 660 aa, 1 stop 
><MW: 75220, pi: 6.76, NX{S/T): 0 

MDDWKPSPLIKPFGARKKRSWYLTWKYKLTNQRALRRFCQTGAVLFLLVTVIVNIKLILDTR 
RAISEANEDPEPEQDYDEALGRLEPPRRRGSGPRRVLDVEVYSSRSKVYVAVDGTTVLEDEA 
REQGRGIHVIVLNQATGHVMAKRVFDTYSPHEDEAMVLFLNMVAPGRVLICTVKDEGSFHLK 
DTAKALLRSLGSQAGPALGWRDTWAFVGRKGGPVFGEKHSKSPALSSWGDPVLLKTDVPLSS 
AEEAECHWADTELNRRRRRFCSKVEGYGSVCSCKDPTPIEFSPDPLPDNKVLNVPVAVIAGN 
RPNYLYRMLRSLLSAQGVS PQMITVF I DGYYEE PMDWALFGLRG I QHT P I S I KNARVSQHY 
KASLTATFNLFPEAKFAWLEEDLDIAVDFFSFLSQSIHLLEEDDSLYCISAWNDQGYEHTA 
EDPALLYRVETMPGLGWVLRRSLYKEELEPKWPTPEKLWDWDMWMRMPEQRRGRECI I PDVS 
RSYHFGIVGLNMNGYFHEAYFKKHKFNTVPGVQLRNVDSLKKEAYEVEVHRLLSEAEVLDHS 
KNPCEDS FLPDTEGHTYVAF I RME KDDDFTTWTQtiAKCLH I WDLDVRGNHRGLWRLFRKKNH 
FL WGVP AS P YS VKKP PS VT P I FLE P P P KEEGAPGAPEQT 

Important features of the protein: 
Transmembrane domain: 

amino acids 3 8-55 

Homologous region to Mouse GNT1 

amino acids 229-660 
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FTGIIRE 55 

CGGACGCGTGGGCTGCTGGTGGGAAGGCCTAAAGAACTGGAAAGCCCACTCTCTTGGAACCACCACAC 
CTGTTTAAAGAACCTAAGCACCATTTAAAGCCACTGGAAATTTGTTGTCTAGTGGTTGTGGGTGAATA 

AAGGAGGG CAGAATGG ATGATTT CATCTCCATT AG CC TG CTGTCTCTGG CTATGTTGGTGGGATGTTA 

CGTGGCCGGAATCATTCCCTTGGCTGTTAATTTCTCAGAGGAACGACTGAAGCTGGTGACTGTTTTGG 

GTGCTGGCCTTCTCTGTGGAACTGCTCTGGCAGTCATCGTGCCTGAAGGAGTACATGCCCTTTATGAA 

GATATTCTTG AGGGAAAACAC CAC CAAGCAAGTGAAACACATAATGTGATTGCAT CAGACAAAGCAGC 

AGAAAAATCAGTTGTCCATGAACATGAGCACAGCCACGACCACACACAGCTGCATGCCTATATTGGTG 

TTTCCCTCGTTCTGGGCTTCGTTTTCATGTTGCTGGTGGACCAGATTGGTAACTCCCATGTGCATTCT 

ACTGACGATCGAGAAGCAGCAAGGTCTAGCAATTCCAAAATCACCACCACGCTGGGTCTGGTTGTCCA 

TG CTGCAG CTG ATGGTGTTGCTTTGGGAGCAGCAG CATCTACTTCAC AGAC C AGTGTCC AGTT AATTG 

TGTTTGTGGCAATCATGCTAC^TAAGGCACCAGCTGCTTTTGGACTGGTTTCCTTCTTGATGCATGCT 

GGCTTAGAGCGG AAT CGAATCAG AAAGCACTTGCTGGTCTTTGCATTGGCAGCAC CAGTT ATGT C CAT 

GGTGACATACTTAGGACTGAGTAAGAGCAGTAAAGAAGCCCTTTCAGAGGTGAACGCCACGGGAGTGG 

CCATGCTTTTCTCTGCCGGGACATTTCTTTATGTTGCCACAGTACATGTCCTCCCTGAGGTGGGCGGA 

AT AGGG CACAGCCAC AAGCCCG ATGCCACGGGAGGGAGAGG CCTCAGCCGCCTGGAAGTGGCAG C CCT 

GGTTCTGGGTTGCCTCATCCCTCTCATCCTGTCAGTAGGACACCAGCATTAAATGTTCAAGGTCCAGC 

CTTGGTCCAGGGCCGTTTGCCATCCAGTGAGAACAGCCGGCACGTGACAGCTACTCACTTCCTCAGTC 

TC TTGT CT CAC CTTG CGCATCT CT AGATGTATTCCT AGAGTCC AGAGGGGAGGTGAGGTTAAAACCTG 

AGTAATGGAAAAGCTTTTAGAGTAGAAACACATTTACGTTGCAGTTAGCTATAGACATCCCATTGTGT 

TATCTTTTAAAAGG CC CTTG AC ATTTTG CGTTTTAATATTTCT CTTAACCCT ATTCTCAGGGAAGATG 

GAATTTAGTTTTAAGGAAAAGAGGAGAACTTCATACTCACAATGAAATAGTGATTATGAAAATACAGT 

GTT CTGTAATTAAGCTATGT CT CTTT CTTCTT AGTT TAGAGG CTCTGCTACTTTAT CCATTGATTTTT 

AACATGGTTCCCACC^TGTAAGACTGGTGCTTTAGCATCTATGCCACATGCGTTGATGGAAGGTCATA 

GCAC C CACTCACTTAGATGCTAAAGGTGATTCT AGTT AAT CTGGG ATT AGGGTCAGGAAAATGATAGC 

AAGACACATTGAAAGC TC TCTTT ATACT C AAAAGAG ATATCCATTGAAAAGGGATGTCT AGAGGGATT 

T AAACAGCT CCTTTGGCACGTGCCTCTCTGAATCCAGCCTGC CATTCCATCAAATGG AG C AGGAGAGG 

TGGGAGGAGCTTCTAAAGAGGTGACTGGTATTTTGT AG CATT CCTTGTCAAGTTCTCCTTTGCAGAAT 

ACCTGTCTCCACATTCCTAGAGAGGAGCCAAGTTCTAGTAGTTTCAGTTCTAGGCTTTCCTTCAAGAA 

CAGTCAGATCACAAAGTGTCTTTGGAAATTAAGGGATATTAAATTTTAAGTGATTTTTGGATGGTTAT 

TGATATCTTTGTAGTAGCTTTTTTTAAAAGACTACCAAAATGTATGGTTGTCCTTTTTTTTTGTTTTT 

TTTTTT TTTAATTATTT CTCTTAGCAGATCAGCAATCC CTCT AGGGACCTAAATACTAGGTC AG CTTT 

GGCGACACTGTGTCTTCTCACATAACCACCTGTAGCAAGATGGATCATAAATGAGAAGTGTTTGCCTA 

TTGATTTAAAGCTTATTGGAATCATGTCTCTTGTCTCTTCGTCITTTCTTTGCTTTTCTTCTA^ 

TCCCTCTAGCCTCTCCTCGCCACAATTTGCTGCTTACTGCTGGTGTTAATATTTGTGTGGGATGAATT 

CTTATCAGGACAACCACTTCTCGAACTGTAATAATGAAGATAATAATATCTTTATTCTTTATCCCCTT 

CAAAGAAATTACCTTTGTGTCAAATGCCGCTTTGTTGAGCCCTTAAAATACC^ 

ATTGACACAATCACTAATCTGGTAATTTAAACAATTGAGATAGCAAAAGTGTTTAACAGACTAGGATA 
AT TTTT T TT TCATATTTGCCAAAATTTTTGTAAACCCTGTCTTGTCAAATAAGTGTATAATATTGTAT 
TATTAATTTATTTTTACTTTCTATACCATTTC7LAAACACATTACACTAAGGGGGAACCAAGACTAGTT 
TCTTCAGGGCAGTGGACGTAGTAGTTTGTAAAAACGTTTTCTATGACGCA^ 

ATTTATTTCCTTCATGAATTTGTCACTGGATCAGCAGCTGTGGAAATAAAGCTTGTGAGCCCTCTGCT 
GGCCACAGTGAGGAAAGTAGCAC^AATAGGATACAGTTGTATGTAGTCATTGGCAACAATTGCATACA 
ATTTTACTACCAAGAGAAGGTATAGTATGGAAAGTCCAAATGACTTCCTTGATTGGATGTTAACAGCT 
GACTGGTGTGAGACTTGAGGTTTC^TCTAGTCCTTCAAAACTATATGGTTGCCTAGATTCTCTCTGGA 
AACTGACTTTGTCAAATAAATAGCAGATTGTAGTGTCAAAAAAAA 
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FIGURE 56 

MDDF I S I SLLSLAMLVGCYVAG I I PLAVNF S E ERLKLVT VLG AGLLCGTALAV I VPEG VHAL 
YEDILEGKHHQASETHNVIASDKAAEKSWHEHEHSHDHTQLHAYIGVSLVLGFVFMLLVDQ 
IGNSHVHSTDDPEAARSSNSKITTTLGLWHAAADGVALGAAASTSQTSVQLIVFVAIMLHK 
AP AAFGLVS F LMHAGLERNR I RKHLL VF AL AAP VMSMVT YLGL S KS S KEAL S E VNATGVAML 
FS AGTFL YVATVHVLPEVGGI GHSHKPDATGGRGLS RLE VAALVLGCL I PL I LS VGHQH 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 37-56, 106-122, 211-230, 240-260, 288-304 
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FIGURE 57 



GCTCGAGGCCGGCGGCGGCGGGAGAGCGACCCGGGCGGCCTCGTAGCGGGGCCCCGGATCCC 

CGAGTGGCGGCCGGAGCCTCGAAAAGAGATTCTCAGCGCTGATTTTGAGATGATGGGCTTGG 

GAAACGGGCGTCGCAGCATGAAGTCGCCGCCCCTCGTGCTGGCCGCCCTGGTGGCCTGCATC 

ATCGTCTTGGGCTTCAACTACTGGATTGCGAGCTCCCGGAGCGTGGACCTCCAGACACGGAT 

CATGGAGCTGGAAGGCAGGGTCCGCAGGGCGGCTGCAGAGAGAGGCGCCGTGGAGCTGAAGA 

AGAACGAGTTCCAGGGAGAGCTGGAGAAGCAGCGGGAGCAGCTTGACAAAATCCAGTCCAGC 

CACAACTTCCAGCTGGAGAGCGTCAACAAGCTGTACCAGGACGAAAAGGCGGTTTTGGTGAA 

TAACATCACCACAGGTGAGAGGCTCATCCGAGTGCTGCAAGACCAGTTAAAGACCCTGCAGA 

GGAATTACGGCAGGCTGCAGCAGGATGTCCTCCAGTTTCAGAAGAACCAGACCAACCTGGAG 

AGGAAGTTCTCCTACGACCTGAGCCAGTGCATCAATCAGATGAAGGAGGTGAAGGAACAGTG 

TGAGGAGCGAATAGAAGAGGTCACCAAAAAGGGGAATGAAGCTGTAGCTTCCAGAGACCTGA 

GTGAAAACAACGACCAGAGACAGCAGCTCCAAGCCCTCAGTGAGCCTCAGCCCAGGCTGCAG 

GCAGCAGGCCTGCCACACACAGAGGTGCCACAAGGGAAGGGAAACGTGCTTGGTAACAGCAA 

GTCCCAGACACCAGCCCCCAGTTCCGAAGTGGTTTTGGATTCAAAGAGACAAGTTGAGAAAG 

AGGAAACCAATGAGATCCAGGTGGTGAATG AGGAGC CTCAGAGGGACAGG CTGC CGCAGGAG 

CCAGGCCGGGAGCAGGTGGTGGAAGACAGACCTGTAGGTGGAAGAGGCTTCGGGGGAGCCGG 

AGAACTGGGCCAGACCCCACAGGTGCAGGCTGCCCTGTCAGTGAGCCAGGAAAATCCAGAGA 

TGGAGGGC CCTGAGCGAGAC CAGCTTGTCATCCCCGACGGACAGGAGGAGGAGCAGGAAGCT 

GCCGGGGAAGGGAGAAACCAGCAGAAACTGAGAGGAGAAGATGACTACAACATGGATGAAAA 

TGAAGCAGAATCTGAGACAGACAAGCAAGCAGCCCTGGCAGGGAATGACAGAAACATAGATG 

TTTTTAATGTTGAAGATCAGAAAAGAGACACCATAAATTTACTTGATCAGCGTGAAAAGCGG 

AATCATACACTCTGAATTGAACTGGAATCACATATTTCACAACAGGGCCGAAGAGATGACTA 

TAAAATGTTCATGAGGGACTGAATACTGAAAACTGTGAAATGT 
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FIGURE 58 

MMGLGNGRRSMKSPPLVLAALVACIIVLGFNYWIASSRSVDLQTRIMELEGRVRRAAAERGA 
VELKKNEPQGELEKQREQLDKIQSSHNFQLESVNKLYQDEKAVLVNNITTGERLIRVIiQDQL 
KTLQRNYGRLQQDVLQFQKNQTNLERKFSYDLSQCINQMKEVKEQCEERIEEVTKKGNEAVA 
SRDLSENNDQRQQLQALSEPQPRLQAAGLPHTEVPQGKGNVLGNSKSQTPAPSSEWLDSKR 
QVEKEETNEIQWNEEPQRDRLPQEPGREQWEDRPVGGRGFGGAGELGQTPQVQAALSVSQ 
ENPEMEGPERDQLVIPDGQEEEQEAAGEGRNQQKLRGEDDYNMDENEAESETDKQAALAGND 
RNIDVFNVEDQKRDTINLLDQREKRNHTL 

Signal peptide: 

amino acids 1-29 
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FIGURE 59 

GG ATG CAGAAAGCCTCAGTGTTGCTCTTCCTGGCCTGGGTCTGCTTCCTCTTCTACGCTGGCATTGCCCTCTTCA 
CCA.GTGGCTTCCTGCTCACCCGTTTGGAGCTCACCAACCATAGCA.GCTGCCAAGAGCCCCCAGGCCCTGGGTCCC 
TGCCATGGGGGAGCCAAGGGAAACCTGGGG CC TG CTGGATGGCTTCCCG ATTTTCGCGGGTTG TGTTGGTG CTGA 
TAGATGCTCTGCGATTTGACTTCGCCC^GCCCCAGCATTCACACGTGCCTAGAGAGCCTCCTGTCTCCCTACCCT 
TCCTGGGCAAACTAAGCTCCTTGCAG AGGATC CTGGAGATTCAGCC CCACC ATG CCCGGCTC TACCGATCTCAGG 
TTGACCCTCCTACCACCACCATGCAGCGCCTCAAGGCCCTCACCACTGGCTCACTGCCrACCTTTATTGATGCTG 
GTAGTAACTTCGCCAGCCACGCCATAGTGGAAGACAATCTCATTAAGCAGCTCACCAGTGCAGGAAGGCGTGTAG 
TCTTCATGGGAGATGATACCTGGAAAGACCTTTTCCCTGGTGCTTTCTCCAAAGCTTTCTTCTTCCCATCCTTCA 
ATGTCAGAGACCTAGACACAGTGGACAATGGCATCCTGGAACACCTCTACCCCACCA 

ACGTGCTGATTGCTCACTTCCTGGGTGTGGACCACTGTGGCCACAAGCATGGCCCTCACCACCCTGAAATGGCCA 

AGAAACTTAGCCAGATGGACCAGGTGATCCAGGGACTTGTGGAGCGTCTGGAGAATGACACACTGCTGGTAGTGG 

CTGGGGACCATGGGATGACCACAAATGGAGACCATGGAGGGGACAGTGAGCTGGAGGTCTCAGCTGCTCTCTTTC 

TGTATAGCCCCACAGCAGTCTTCCCCAGCACCCCACCAGAGGAGCCAGAGGTGATTCCTCAAGTTAGCCTTGTGC 

CCACGCTGGCCCTGCTGCTGGGCCTGCCCATCCCATTTGGGAATATCGGGGAAGTGATGGCTGAGCTATTCTCAG 

GGGGTGAGGACTCCCAGCCCCACTCCTCTGCTTTAGCCCAAGCCTCAGCTCTCCATCTCAATGCTCAGCAGGTGT 

CCCGATTTCTTCATACCTACTCAGCTGCTACTCAGGACCTTCAAGCTAAGGAGCTTCATCAGCTGCAGAACCTCT 

TCTCCAAGGCCTCTGCTGACTACCAGTGGCTTCTCCAGAGCCCCAAGGGGGCTGAGGCGACACTGCCGACTGTGA 

TTGCTGAGCTGCAGCAGTTCCTGCGGGGAGCTCGGGCCATGTGCATCGAGTCTTGGGCTCGTTTCTCTCTGGTCC 

GCATGGCGGGGGGTACTGCTCTCTTGGCTGCTTCCTGCTTTATCTGCCTGCTGGCATCTCAGTGGGCAATATCCC 

CAGGCTTTCCATTCTGCCCTCTACTCCTGACACCTGTGGCCTGGGGCCTGGTTGGGGCCATAGCGTATGCTGGAC 

TCCTGGGAACTATTGAGCTGAAGCTAGATCTAGTGCTTCTAGGGGCTGTGGCTGCAGTGAGCTCATTCCTCCCTT 

TTCTGTGGAAAGCCTGGGCTGGCTGGGGGTCCAAGAGGCCCCTGGCAACCCTGTTTCCCATCCCTGGGCCCGTCC 

TGTTACTCCTGCTGTTTCGCTTGGCTGTGTTCTTCTCTGATAGTTTTGTTGTAGCTGAGGCCAGGGCCACCCCCT 

TCCTTTTGGGCTCATTCATCCTGCTCCTGGTTGTCCAGCTTCACTGGGAGGGCCAGCTGCTTCCACCTAAGCTAC 

TCACAATGCCCCGCCTTGGCACTTCAGCCACMCAAACCCCCCACGGCACAATGGTGCATATGCCCTGAGGCTTG 

GAATTGGGTTG CTTTTATGTACAAGG CTAG CTGGGCTTTTTCATCGTTGC CCTG AAG AG ACACCTGTTTGCCACT 

CCTCTCCCTGGCTGAGTCCTCTGGCATCCATGGTGGGTGGTCGAGCCAAGAATTTATGGTATGGAGCTTGTGTGG 

CGGCGCTGGTGGCCCTGTTAGCTGCCGTGCGCTTGTGGCTTCGCCGCTATGGTAATCTCAAGAGCCCCGAGCCAC 

CCATGCTCTTTGTGCGCTGGGGACTGCCCCTAATGGCATTGGGTACTGCTGCCTACTGGGCATTGGCGTCGGGGG 

CAGATGAGGCTCCCCCCCGTCTCCGGGTCCTGGTCTCTGGGGCATCCATGGTGCTGCCTCGGGCTGTAGCAGGGC 

TGGCTGCTTCAGGGCTCGCGCTGCTGCTCTGGAAGCCTGTGACAGTGCTGGTGAAGGCTGGGGCAGGCGCTCCAA 

GGACCAGGACTGTCCTCACTCCCTTCTCAGGCCCCCCCACTTCTCAAGCTGACTTGGATTATGTGGTCCCTCAAA 

TCTACCGACACATGCAGGAGGAGTTCCGGGGCCGGTTAGAGAGGACCAAATCTCAGGGTCCCCTGACTGTGGCTG 

CTTATCAGTTGGGGAGTGTCTACTCAGCTGCTATGGTCAC^GCCCTCACCCTGTTGGCCTTCCCACTTCTGCrrGT 

TGCATGCGGAGCGCATCAGCCTTGTGTTCCTGCTTCTGTTTCTGCAGAGCTTCCTTCTCCTACATCTGCTTGCT 

CTGGGATACCCGTCACCACCCCTGGTCCTTTTACTGTG CC ATGG CAGGCAGTCTCGGCTTGGGCC CTCATGGCCA 

CACAGACCTTCTACTCCACAGGCCACCAGCCTGTCTTTCCAGCCIATCCATTGGCATGCAGCCTTCGTGGGATTC 

CAGAGGGTCATGGCTCCTGTACTTGGCTGCCTGCTTTGCTAGTGGGAGCCAACACCTTTGCCTCCCACCTCCTCT 

TTGCAGTAGGTTGCCCACTGCTCCTGCTCTGGCCTTTCCTGTGTGAGAGTCAAGGGCTGCGGAAGAGACAGCAGC 

CCCCAGGGAATGAAGCTGATGCCAGAGTCAGACCCGAGGAGGAAGAGGAGCCACTGATGGAGATGCGGCTCCGGG 

ATGCGCCTC^GCACTTCTATGCAGCACTGCTGCAGCTGGGCCTCAAGTACCTCTTTATCCrTGGTATTCAGATrC 

TGGCCTGTGCCITGGCAGCCTCCATCCTTCGCAGGC^TCITCATGGT^ 

TTGAGGCTGTGGGCTTCATTGTGAGCAGCGTGGGACn^CTCCTGGGCATAGCTTTGGTGATGAGAGTGGATGGTC 

CTGTGAGCTCCTGGTTCAGGCAGCTATTTCTGGCCCAGCAGAG GTAG CCTAGT 

ACAGAGAGTGCTGGAGAACAGTGTAGCCTGGCCTGTACAGGTACTGGATGATCTGC^ 

TCTTACTATCATGCAGCCAGGGGCCGCTGAC^TCTAGGACTTCATTATTCrATAATTCAGGACCACAGTGGAGTA 
TGATCCCTAACTCCTGATTTGGATGCATCTGAGGGACAAGGGGGGCGGTCTCCGAAGTGGAATAAAATAGGCCGG 
GCGTG<3TGACTTGCACCTATAATCCCAGCACTTTGGGAGGC^GAGGTGGGAGGATTGCTTGGTCCC^GGAGTTC^ 
AGACCAGCCTGTGGAACATAACAAGACCCCGTCTCTACTATTTAAAAAAAAGTGTAATAAAATGATAATAT 
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FIGURE 60 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA6280 9 
<subunit 1 of 1, 1089 aa, 1 stop 
<MW: 118699, pi: 8.49, NX{S/T): 2 

MQKASVLLFLAWVCFLFYAGIALFTSGFLLTRLELTNHSSCQEPPGPGSLPWGSQGKPGACW 
MASRFSRWLVLIDALRFDFAQPQHSHVPREPPVSLPFLGKLSSLQRILEIQPHHARLYRSQ 
VDPPTTTMQRLKALTTGSLPTFIDAGSNFASHAIVEDNLIKQLTSAGRRWFMGDDTWKDIiF 
PGAFSKAFFFPSFNVRDLDTVDNGILEHLYPTMDSGEWDVLIAHFLGVDHCGHKHGPHHPEM 
AKKLSQMDQVIQGLVERLENDTLLWAGDHGMTTNGDHGGDSELEVSAALFLYSPTAVFPST 
PPEEPEVIPQVSLVPTLALLLGLPIPFGNIGEVMAELFSGGEDSQPHSSALAQASALHLNAQ 
QVSRFLHTYSAATQDLQAKELHQLQNLFSKASADYQWLLQSPKGAEATLPTVIAELQQFLRG 
ARAMC I E SWARFS LVRMAGGTALLAAS CF I CLLASQWA I S PGFPFC PLLLTP VAWGLVGAI A 
YAGLLGTIELKLDLVLLGAVAAVSSFLPFLWKAWAGWGSKRPLATLFPIPGPVLLLLLFRLA 
VFFSDSFWAEARATPFLLGSFILLLWQLHWEGQLLPPKLLTMPRLGTSATTNPPRHNGAY 
ALRLG I GL LL CTRLAGLFHRCP EETP VCHS S P WL S PLAS MVGGRAKNLW YG AC VAAL VALLA 
AVRLWLRRYGNLKSPEPPMLFVRWGLPLMALGTAAYWALASGADEAPPRLRVLVSGASMVLP 
RAVAGLAASGLALLLWKPVTVLVKAGAGAPRTRTVLTPFSGPPTSQADLDYWPQIYRHMQE 
EFRGRLERTKSQGPLTVAAYQLGSVYSAAMVTALTLLAFPLLLLHAERISLVFLLLFLQSFL 
LLHLLAAGIPVTTPGPFTVPWQAVSAWALMATQTFYSTGHQPVFPAIHWHAAFVGFPEGHGS 
CTWLPALLVGANTFASHLLFAVGCPLLLLWPFLCESQGLRKRQQPPGNEADARVRPEEEEEP 
LMEMRLRDAPQHFYAALLQLGLKYLF I LGI Q I LACALAAS I LRRHLMVWKVFAPKF I FEAVG 
F TVS S VGLLLG I ALVMRVDGAVS SVfFRQLFLAQQR 



Important features: 
Signal peptide: 

amino acids 1-16 



Transmembrane domains: 

amino acids 317-341, 451-470, 481-500, 510-527, 538-555, 831-850, 
1016-1034, 1052-1070 



Leucine zipper pattern. 

amino acids 843-864 



N-glycosylation sites. 

amino acids 37-40, 268-271 
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FTGURE 61 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGT 

GTCTCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGT 

CCTACAATGGACTCCACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATT 

TCATCACAAATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGT 

CCTGACAGCTCCAGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAA 

TATACTCCAATCTGAAGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCC 

CAGTGTGTGACCAACCACACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGT 

ACACGTGGAGTCCTTCGTCCCAGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTG 

CCAGGACTTTGAAAGATCAATCATCAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTG 

CCCATATCTATTACCGTGTTTCTTTTTTCTGTGATGGGCTATTCCATCTACCGATATATCCA 

CGTTGGCAAAGAGAAACACCCAGCAAATTTGATTTTGATTTATGGAAATGAATTTGACAAAA 

GATTCTTTGTGCCTGCTGAAAAAATCGTGATTAACTTTATCACCCTCAATATCTCGGATGAT 

TCTAAAATTTCTCATCAGGATATGAGTTTACTGGGAAAAAGCAGTGATGTATCCAGCCTTAA 

TGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGAGGAAGAGGAGGTGAAACATTTAG 

GGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAGAAAACACGGAAGGTACTTCT 

CTCACCCAGCAAGAGTCC CTCAGCAGAACAATACCCCCGGAT AAAACAGT CATTGAATATGA 

ATATGATGTCAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGAGCTCAGTTTGCAGG 

AGGAGGTGTCCACACAAGGAACATTATTGGAGTCGCAGGCAGCGTTGGCAGTCTTGGGCCCG 

CAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCGCAGGAGCA 

CACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGATCCCC 

AAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCGAG 

CCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGC 

TCCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGT 

TATATGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAAC 

AAGTGAGTCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGT 

TTGTCAGTGTCTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTG 

GTTCATGCATGTAGGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGT 

TGTTCTATGCAGAGAAAGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGG 

TGGGTGT 
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FTGTJRE 62 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA62815 
<subunit 1 of 1, 442 aa, 1 stop 
<MW: 49932, pi: 4.55, NX(S/T): 5 

MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQ 
QIYSNLKY1WSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQ 
CARTLKDQ S S E P KAK 1 1 FWY VL P I S I T VFLF S VMG Y S I YR YI HVGKE KH P ANL I L I YGNE FD 
KRFFVPAEKIVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKH 
LGYASHLMEIFCDSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSL 
QEEVSTQGTLLESQAALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWD 
PQTGRLCIPSLSSFDQDSEGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEW 
GLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycoaylation sites* 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 



WO 00/12708 



PCT/US99/20U1 



FIGURE 63 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGTCTCTGCGGGGAGACGCCAGCCTGCG 
TCTGCCATGGGGCTCGGGTTGAGGGGCTGGGGACGTCCTCTGCTGACTGTGGCCACCGCCCT 
GATGCTGCCCGTGAAGCCCCCCGCAGGCTCCTGGGGGGCCCAGATCATCGGGGGCCACGAGG 
TGACCCCCCACTCCAGGCCCTACATGGCATCCGTGCGCTTCGGGGGCCAACATCACTGCGGA 
GGCTTCCTGCTGCGAGCCCGCTGGGTGGTCTCGGCCGCCCACTGCTTCAGCCACAGAGACCT 
CCGCACTGGCCTGGTGGTGCTGGGCGCCCACGTCCTGAGTACTGCGGAGCCCACCCAGCAGG 
TGTTTGGCATCGATGCTCTCACCACGCACCCCGACTACCACCCCATGACCCACGCCAACGAC 
ATCTGCCTGCTGCGGCTGAACGGCTCTGCTGTCCTGGGCCCTGCAGTGGGGCTGCTGAGGCT 
GCCAGGGAGAAGGGCCAGGCCCCCCACAGCGGGGACACGGTGCCGGGTGGCTGGCTGGGGCT 
TCGTGTCTGACTTTGAGGAGCTGCCGCCTGGACTGATGGAGGCCAAGGTCCGAGTGCTGGAC 
CCGGACGTCTGCAACAGCTCCTGGAAGGGCCACCTGACACTTACCATGCTCTGCACCCGCAG 
TGGGGACAGCCACAGACGGGGCTTCTGCTCGGCCGACTCCGGAGGGCCCCTGGTGTGCAGGA 
ACCGGGCTCACGGCCTCGTTTCCTTCTCGGGCCTCTGGTGCGGCGACCCCAAGACCCCCGAC 
GTGTACACGCAGGTGTCCGCCTTTGTGGCCTGGATCTGGGACGTGGTTCGGCGGAGCAGTCC 
CCAGCCCGGCCCCCTGCCTGGGACCACCAGGCCCCCAGGAGAAGCCGCCTGAGCCACAACCT 
TGCGGCATGCAAATGAGATGGCCGCTCCAGGCCTGGAATGTTCCGTGGCTGGGCCCCACGGG 
AAGCCTGATGTTCAGGGTTGGGGTGGGACGGGCAGCGGTGGGGCACACCCATTCCACATGCA 
AAGGGCAGAAGCAAACCCAGTAAAATGTTAACTGACAAAAAAAAAAAAAAAAAAAAGAAA 
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FIGURE 64 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA62845 
xsubunit 1 of 1, 283 aa, 1 stop 
><MW: 30350, pi: 9.66, NX(S/T): 2 

MGLGLRGWGRPLLTVATALMLPVKPPAGSWGAQIIGGHEVTPHSRPYMASVRFGGQHHCGGF 
LLRARWWSAAHCFSHRDLRTGLWLGAHVLSTAEPTQQVFGIDALTTHPDYHPMTHANDIC 
LLRLNGSAVLGPAVGLLRLPGRRARPPTAGTRCRVAGWGFVSDFEELPPGLMEAKVRVLDPD 
VCNSSWKGHLTLTMLCTRSGDSHRRGFCSADSGGPLVCRNRAHGLVSFSGLWCGDPKTPDVY 
TQVSAFVAWIWDWRRSSPQPGPLPGTTRPPGEAA 

Signal peptide: 

amino acids 1-30 
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FIGURE 65 

GAGCTACCCAGGCGGCTGGTGTGCAGCAAGCTCCGCGCCGACTCCGGACGCCTGACGCCTGA 
CGCCTGTCCCCGGCCCGGCATGAGCCGCTACCTGCTGCCGCTGTCGGCGCTGGGCACGGTAG 
CAGGCGCCGCCGTGCTGCTCAAGGACTATGTCACCGGTGGGGCTTGCCCCAGCAAGGCCACC 
ATCCCTGGGAAGACGGTCATCGTGACGGGCGCCAACACAGGCATCGGGAAGCAGACCGCCTT 
GGAACTGGCCAGGAGAGGAGGCAACATCATCCTGGCCTGCCGAGACATGGAGAAGTGTGAGG 
CGGCAGCAAAGGACATCCGCGGGGAGACCCTCAATCACCATGTCAACGCCCGGCACCTGGAC 
TTGGCTTCCCTCAAGTCTATCCGAGAGTTTGCAGCAAAGATCATTGAAGAGGAGGAGCGAGT 
GGACATTCTAATCAACAACGCGGGTGTGATGCGGTGCCCCCACTGGACCACCGAGGACGGCT 
TCGAGATGCAGTTTGGCGTTAACCACCTGGGTCACTTTCTCTTGACAAACTTGCTGCTGGAC 
AAGCTGAAAGCCTCAGCCCCTTCGCGGATCATCAACCTCTCGTCCCTGGCCCATGTTGCTGG 
GCACATAGACTTTGACGACTTGAACTGGCAGACGAGGAAGTATAACACCAAAGCCGCCTACT 
GCCAGAGCAAGCTCGCCATCGTCCTCTTCACCAAGGAGCTGAGCCGGCGGCTGCAAGGCTCT 
GGTGTGACTGTCAACGCCCTGCACCCCGGCGTGGCCAGGACAGAGCTGGGCAGACACACGGG 
CATCCATGGCTCCACCTTCTCCAGCACCACACTCGGGCCCATCTTCTGGCTGCTGGTCAAGA 
GCCCCGAGCTGGCCGCCCAGCCCAGCACATACCTGGCCGTGGCGGAGGAACTGGCGGATGTT 
TCCGGAAAGTACTTCGATGGACTCAAACAGAAGGCCCCGGCCCCCGAGGCTGAGGATGAGGA 
GGTGGCCCGGAGGCTTTGGGCTGAAAGTGCCCGCCTGGTGGGCTTAGAGGCTCCCTCTGTGA 
GGGAGCAGCCCCTCCCCAGATAACCTCTGGAGCAGATTTGAAAGCCAGGATGGCGCCTCCAG 
ACCGAGGACAGCTGTCCGCCATGCCCGCAGCTTCCTGGCACTACCTGAGCCGGGAGACCCAG 
GACTGGCGGCCGCCATGCCCGCAGTAGGTTCTAGGGGGCGGTGCTGGCCGCAGTGGACTGGC 
CTGCAGGTGAGCACTGCCCCGGGCTCTGGCTGGTTCCGTCTGCTCTGCTGCCAGCAGGGGAG 
AGGGGCCATCTGATGCTTCCCCTGGGAATCTAAACTGGGAATGGCCGAGGAGGAAGGGGCTC 
TGTGCACTTGCAGGCCACGTCAGGAGAGCCAGCGGTGCCTGTCGGGGAGGGTTCCAAGGTGC 
TCCGTGAAGAGCATGGGCAAGTTGTCTGACACTTGGTGGATTCTTGGGTCCCTGTGGGACCT 
TGTGCATGCATGGTCCTCTCTGAGCCTTGGTTTCTTCAGCAGTGAGATGCTCAGAATAACTG 
CTGTCTCCCATGATGGTGTGGTACAGCGAGCTGTTGTCTGGCTATGGCATGGCTGTGCCGGG 
GGTGTTTGCTGAGGGCTTCCTGTGCCAGAGCCCAGCCAGAGAGCAGGTGCAGGTGTCATCCC 
GAGTTCAGGCTCTGCACGGCATGGAGTGGGAACCCCACCAGCTGCTGCTACAGGACCTGGGA 
TTGCCTGGGACTCCCACCTTCCTATCAATTCTCATGGTAGTCCAAACTGCAGACTCTCAAAC 
TTGCTCATTT 
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FIGURE 66 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA64 842 
xsubunit 1 of 1, 331 aa, 1 stop 
><MW: 35932, pi: 8.45, NX(S/T): 1 

MSRYLLPLSALGTVAGAAVLLKDYVTGGACPSKATIPGKTVIVTGANTGIGKQTALELARRG 
GNI ILACRDMEKCEAAAKDIRGETLNHHVNARHLDLASLKSIREFAAKI IEEEERVDILINN 
AGVMRCPHWTTEDGFEMQFGWHLGHFLLTNLLLDKLKASAPSRIINLSSLAHVAGHIDFDD 
LNWQTRKYNTKAAYCQSKIjAIVLFTKELSRRLQGSGVTVNALHPGVARTELGRHTGIHGSTF 
SSTTLGPIFWLLVKSPELAAQPSTYLAVAEELADVSGKYFDGLKQKAPAPEAEDEEVARRLW 
AESARLVGLEAPSVREQPLPR 

Signal peptide: 

amino acids 1-17 



(.6/2-70 
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FIGURE 67 

GAAGTTCGCGAGCGCTGGCATGTGGTCCTGGGGCGCGGCTGGCGGCGCTGCTGGCGGTGCTG 
GCGCTCGGGACAGGAGACCCAGAAAGGGCTGCGGCTCGGGGCGACACGTTCTCGGCGCTGAC 
CAGCGTGGCGCGCGCCCTGGCGCCCGAGCGCCGGCTGCTGGGGCTGCTGAGGCGGTACCTGC 
GCGGGGAGGAGGCGCGGCTGCGGGACCTGACTAGATTCTACGACAAGGTACTTTCTTTGCAT 
GAGGATTCAACAACCCCTGTGGCTAACCCTCTGCTTGCATTTACTCTCATCAAACGCCTGCA 
GTCTGACTGGAGGAATGTGGTACATAGTCTGGAGGCCAGTGAGAACATCCGAGCTCTGAAGG 
ATGGCTATGAGAAGGTGGAGCAAGACCTTCCAGCCTTTGAGGACCTTGAGGGAGCAGCAAGG 
GCCCTGATGCGGCTGCAGGACGTGTACATGCTCAATGTGAAAGGCCTGGCCCGAGGTGTCTT 
TCAGAGAGTCACTGGCTCTGCCATCACTGACCTGTACAGCCCCAAACGGCTCTTTTCTCTCA 
CAGGGGATGACTGCTTCCAAGTTGGCAAGGTGGCCTATGACATGGGGGATTATTACCATGCC 
ATTCCATGGCTGGAGGAGGCTGTCAGTCTCTTCCGAGGATCTTACGGAGAGTGGAAGACAGA 
GGATGAGGCAAGTCTAGAAGATGCCTTGGATCACTTGGCCTTTGCTTATTTCCGGGCAGGAA 
ATGTTTCGTGTGCCCTCAGCCTCTCTCGGGAGTTTCTTCTCTACAGCCCAGATAATAAGAGG 
ATGGCCAGGAATGTCTTGAAATATGAAAGGCTCTTGGCAGAGAGCCCCAACCACGTGGTAGC 
TGAGGCTGTCATCCAGAGGCCCAATATACCCCACCTGCAGACCAGAGACACCTACGAGGGGC 
TATGTCAGACCCTGGGTTCCCAGCCCACTCTCTACCAGATCCCTAGCCTCTACTGTTCCTAT 
GAGACCAATTCCAACGCCTACCTGCTGCTCCAGCCCATCCGGAAGGAGGTCATCCACCTGGA 
GCCCTACATTGCTCTCTACCATGACTTCGTCAGTGACTCAGAGGCTCAGAAAATTAGAGAAC 
TTGCAGAACCATGGCTACAGAGGTCAGTGGTGGCATCAGGGGAGAAGCAGTTACAAGTGGAG 
TACCGCATCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTGGTGACCCTCAA 
CCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCAGAGTATCTGCAGG 
TGGTGAACTATGGCATCGGAGGACACTATGAGCCTCACTTTGACCATGCTACGTCACCAAGC 
AGCCCCCTCTACAGAATGAAGTCAGGAAACCGAGTTGCAACATTTATGATCTATCTGAGCTC 
GGTGGAAGCTGGAGGAGC CACAGCCTTCATCTATGC CAAC CT CAGCGTGCCTGTGGTTAGGA 
ATGCAGCACTGTTTTGGTGGAACCTGCACAGGAGTGGTGAAGGGGACAGTGACACACTTCAT 
GCTGGCTGTCCTGTCCTGGTGGGAGATAAGTGGGTGGCCAACAAGTGGATACATGAGTATGG 
ACAGGAATTC CGCAGACC CTGCAGCTCCAG CC CTGAAGACTGAACTGTTGGCAGAGAGAAGC 
TGGTGGAGTCCTGTGGCTTTCCAGAGAAGCCAGGAGCCAAAAGCTGGGGTAGGAGAGGAGAA 
AGCAGAGCAGCCTCCTGGAAGAAGGCCTTGTCAGCTTTGTCTGTGCCTCGCAAATCAGAGGC 
AAGGGAGAGGTTGTTACCAGGGGACACTGAGAATGTACATTTGATCTGCCCCAGCCACGGAA 
GTCAGAGTAGGATGCACAGTACAAAGGAGGGGGGAGTGGAGGCCTGAGAGGGAAGTTTCTGG 
AGTTCAGATACTCTCTGTTGGGAACAGGACATCTCAACAGTCTCAGGTTCGATCAGTGGGTC 
TTTTGGCACTTTGAAC CTTGAC CACAGGGACCAAGAAGTGGCAATGAGGACACCTGCAGGAG 
GGGCTAGCCTGACTCCCAGAACTTTAAGACTTTCTCCCCACTGCCTTCTGCTGCAGCCCAAG 
CAGGGAGTGTCCCCCTCCCAGAAGCATATCCCAGATGAGTGGTACATTATATAAGGATTTTT 
TTTAAGTTGAAAACAACTTTCTTTTCTTTTTGTATGATGGTTTTTTAACACAGTCATTAAAA 
ATGTTTATAAATCAAAA 
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FIGURE 68 

MGPGARLAALLAVLALGTGDPERAAARGDTFSALTSVARALAPERRLLGLLRRYLRGEEARL 
RDLTRFYDKVLSLHEDSTTPVANPLLAFTLIKRLQSDWRNWHSLEASENIRALKDGYEKVE 
QDLPAFEDLEGAARALMRLQDVYMLNVKGLARGVFQRVTGSAI TDLYS PKRLFSLTGDDCFQ 
VGKVAYDMGDYYHAIPWLEEAVSLFRGSYGEWKTEDEASLEDALDHLAFAYFRAGNVSCALS 
LSREFLLYSPDNKRMARNVLKYERLLAESPNHWAEAVIQRPNIPHLQTRDTYEGLCQTLGS 
QPTLYQIPSLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQ 
RSWASGEKQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQWNYGIG 
GHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPWRNAALFWW 
NLHRS GE GD S DT LHAGC P VL VGDKWVANKW I HE YGQEFRRP C S S S PED 

Signal peptide: 

amino acids 1-19 



WO 00/12708 



PCIYUS99/20111 



FIGURE 69 

GAGATAGGGAGTCTGGGTTTAAGTTCCTGCTCCATCTCAGGAGCCCCTGCTCCCACCCCTAG 
GAAGCCACCAGACTCCACGGTGTGGGGCCAATCAGGTGGAATCGGCCCTGGCAGGTGGGGCC 
ACGAGCGCTGGCTGAGGGACCGAGCCGGAGAGCCCCGGAGCCCCCGTAACCCGCGCGGGGAG 
CGCCCAGGATGCCGCGCGGGGACTCGGAGCAGGTGCGCTACTGCGCGCGCTTCTCCTACCTC 
TGGCTCAAGTTTTCACTTATCATCTATTCCACCGTGTTCTGGCTGATTGGGGCCCTGGTCCT 
GTCTGTGGGCATCTATGCAGAGGTTGAGCGGCAGAAATATAAAACCCTTGAAAGTGCCTTCC 
TGGCTCCAGCCATCATCCTCATCCTCCTGGGCGTCGTCATGTTCATGGTCTCCTTCATTGGT 
GTGCTGGCGTCCCTCCGTGACAACCTGTACCTTCTCCAAGCATTCATGTACATCCTTGGGAT 
CTGCCTCATCATGGAGCTCATTGGTGGCGTGGTGGCCTTGACCTTCCGGAACCAGACCATTG 
ACTTCCTGAACGACAACATTCGAAGAGGAATTGAGAACTACTATGATGATCTGGACTTCAAA 
AACATCATGGACTTTGTTCAGAAAAAGTTCAAGTGCTGTGGCGGGGAGGACTACCGAGATTG 
GAGCAAGAATCAGTACCAGGACTGCAGTGCCCCTGGACCCCTGGCCTGTGGGGTGCCCTACA 
CCTGCTGCATCAGGAACACGACAGAAGTTGTCAACACCATGTGTGGCTACAAAACTATCGAC 
AAGGAGCGTTTCAGTGTGCAGGATGTCATCTACGTGCGGGGCTGCACCAACGCCGTGATCAT 
CTGGTTCATGGACAACTACACCATCATGGCGTGCATCCTCCTGGGCATCCTGCTTCCCCAGT 
TCCTGGGGGTGCTGCTGACGCTGCTGTACATCACCCGGGTGGAGGACATCATCATGGAGCAC 
TCTGTCACTGATGGGCTCCTGGGGCCCGGTGCCAAGCCCAGCGTGGAGGCGGCAGGCACGGG 
ATGCTGCTTGTGCTACCCCAATTAGGGCCCAGCCTGCCATGGCAGCTCCAACAAGGACCGTC 
TGGGATAGCACCTCTCAGTCAACATCGTGGGGCTGGACAGGGCTGCGGCCCCTCTGCCCACA 
CTCAGTACTGACCAAAGCCAGGGCTGTGTGTGCCTGTGTGTAGGTCCCACGGCCTCTGCCTC 
CCCAGGGAGCAGAGCCTGGGCCTCCCCTAAGAGGCTTTCCCCGAGGCAGCTCTGGAATCTGT 
GCCCACCTGGGGCCTGGGGAACAAGGCCCTCCTTTCTCCAGGCCTGGGCTACAGGGGAGGGA 
GAGCCTGAGGCTCTGCTCAGGGCCCATTTCATCTCTGGCAGTGCCTTGGCGGTGGTATTCAA 
GGCAGTTTTGTAGCACCTGTAATTGGGGAGAGGGAGTGTGCCCCTCGGGGCAGGAGGGAAGG 
GCATCTGGGGAAGGGCAGGAGGGAAGAGCTGTCCATGCAGCCACGCCCATGGCCAGGTTGGC 
CTCTTCTCAGCCTCCCAGGTGCCTTGAGCCCTCTTGCAAGGGCGGCTGCTTCCTTGAGCCTA 
GTTTTTTTTTACGTGATTTTTGTAACATTCATTTTTTTGTACAGATAACAGGAGTTTCTGAC 
TAATCAAAGCTGGTATTTCCCCGCATGTCTTATTCTTGCCCTTCCCCCAACCAGTTTGTTAA 
TCAAACAATAAAAACATGTTTTGTTTTGTTTTTAAAAAAAAA 
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FTGURE 70 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA64 863 
xsubunit 1 of 1, 294 aa, 1 stop 
><MW: 33211, pi: 5.35, NX(S/T): 3 

MPRGDSEQVRYCARFSYLWLKFSLIIYSTVFWLIGALVLSVGIYAEVERQKYKTLESAFLAP 
AIILILLGVVMFWSFIGVLASLRDNLYLLQAFMYILGICLIMELIGGWALTFRNQTIDFL 
NDNIRRGIENYYDDLDFKNIMDFVQKKFKCCGGEDYRDWSKNQYHDCSAPGPLACGVPYTCC 
IRNTTEWNTMCGYKTIDKERFSVQDVIYVRGCTNAVIIWFMDNYTIMACILLGILLPQFLG 
VLLTLLYITRVEDIIMEHSVTDGLLGPGAKPSVEAAGTGCCLCYPN 

Signal peptide: 

amino acids 1-44 

Transmembrane domains : 

amino acids 22-42 # 57-85, 93-116, 230-257 



WO 00/12708 



PCT/US99/20111 



FIGURE 71 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGA 
CACCTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACC 
TTGATCCAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGA 
AAAGCTGACACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGC 
TC AGTGCC ATGCGGG AAAAG CC AG CCGGAGGC AT CC CTGTGCTGGG CAGC CTGGTGAACACC 
GTCCTGAAGCACATCATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAA 
GCCCTCGGCCAATGACCAGGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCA 
ACACGCCCCTGGTCAAGACCATCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATC 
CGCATGGACACCAGTGCAAGTGGCCCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCA 
TGGGAGCCTGCGCATCCAACTGCTGTATAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGC 
AGGTCATGAACCTCCTAGTGCCATCCCTGCCCAATCTAGTGAAAAACCAGCTGTGTCCCGTG 
ATCGAGGCTTCCTTCAATGGCATGTATGCAGACCTCCTGCAGCTGGTGAAGGTGCCCATTTC 
CCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTATCCTGCCATCAAGGGTGACACCATTC 
AGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAAAGGTGACCAAGTGGTTCAATAAC 
TCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCGTTCAGCCTCATCGTGAGTCA 
GGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGAATTCATGGTCCTGTTGG 
ACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGCTGATCAATGAAAAG 
GCTGCAGATAAG CTGGGATC TACC CAGATCGTGAAGATCCTAACTCAGGACACTCCCGAGTT 
TTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTTTCCCTCCA 
GTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGTTTTAC 
ACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTGAT 
GAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCC 
ACTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTG 
AAGGCCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCC 
AGCCTCCTTGTGGAAACCCAGCTCTCCTGTCTCCCAG TGA AGACTTGGATGGCAGCCATCAG 
GGAAGGCTGGGTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAAT 
CAATAAACACTTGCCTGTGAAAAA 
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FTGTJRE 72 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA648 81 
xsubunit 1 of 1, 484 aa, 1 stop 
><MW: 52468, pi: 7.14, NX(S/T): 3 

MAGPWTFTLLCGLLAATL IQ ATLS PTAVL I LGPKVI KEKLTQELKDHNATS ILQQLPLLSAM 
REKPAGGI PVLGSLVNTVLKHI IWLKVITANILQLQVKPSANDQELLVKI PLDMVAGFNTPL 
VKTIVEFHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMN 
LLVPSLPNLVKNQLCPVIEASFNGMYADLLQLVKVPISLSIDRLEFDLLYPAIKGDTIQLYL 
GAKLLDSQGKVTKWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVL 
PESAHRLKSSIGLINEKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEAL 
RPLFTLGIEASSEAQFYTKGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSIL 
LPNQNGKLRSGVPVSLVKALGFEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site* 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 73 

GAGCGAACATGGCAGCGCGTTGGCGGTTTTGGTGTGTCTCTGTGACCATGGTGGTGGCGCTG 
CTCATCGTTTGCGACGTTCCCTCAGCCTCTGCCCAAAGAAAGAAGGAGATGGTGTTATCTGA 
AAAGGTTAGTCAGCTGATGGAATGGACTAACAAAAGACCTGTAATAAGAATGAATGGAGACA 
AGTTCCGTCGCCTTGTGAAAGCCCCACCGAGAAATTACTCCGTTATCGTCATGTTCACTGCT 
CTCCAACTGCATAGACAGTGTGTCGTTTGCAAGCAAGCTGATGAAGAATTCCAGATCCTGGC 
AAACTCCTGGCGATACTCCAGTGCATTCAC CAAC AGGATATTTTTTGC CATGGTGGATTTTG 
ATGAAGGCTCTGATGTATTTCAGATGCTAAACATGAATTCAGCTCCAACTTTCATCAACTTT 
CCTGCAAAAGGGAAACCCAAACGGGGTGATACATATGAGTTACAGGTGCGGGGTTTTTCAGC 
TGAGCAGATTGCCCGGTGGATCGCCGACAGAACTGATGTCAATATTAGAGTGATTAGACCCC 
CAAATTATGCTGGTCCCCTTATGTTGGGATTGCTTTTGGCTGTTATTGGTGGACTTGTGTAT 
CTTCGAAGAAGTAATATGGAATTTCTCTTTAATAAAACTGGATGGGCTTTTGCAGCTTTGTG 
TTTTGTGCTTGCTATGACATCTGGTCAAATGTGGAACCATATAAGAGGACCACCATATGCCC 
ATAAGAATCCCCACACGGGACATGTGAATTATATCCATGGAAGCAGTCAAGCCCAGTTTGTA 
GCTGAAACACACATTGTTCTTCTGTTTAATGGTGGAGTTACCTTAGGAATGGTGCTTTTATG 
TGAAGCTGCTACCTCTGACATGGATATTGGAAAGCGAAAGATAATGTGTGTGGCTGGTATTG 
GACTTGTTGTATTATTCTTCAGTTGGATGCTCTCTATTTTTAGATCTAAATATCATGGCTAC 
CCATACAGCTTTCTGATGAGTTAAAAAGGTCCCAGAGATATATAGACACTGGAGTACTGGAA 
ATTGAAAAACGAAAATCGTGTGTGTTTGAAAAGAAGAATGCAACTTGTATATTTTGTATTAC 
CTCTTTTTTT CAAGTGATTTAAATAGTTAATCATTTAAC CAAAGAAGATGTGTAGTGCCTTA 
ACAAGCAATCCTCTGTCAAAATCTGAGGTATTTGAAAATAATTATCCTCTTAACCTTCTCTT 
CCCAGTGAACTTTATGGAACATTTAATTTAGTACAATTAAGTATATTATAAAAATTGTAAAA 
CTACTACTTTGTTTTAGTTAGAACAAAGCTCAAAACTACTTTAGTTAACTTGGTCATCTGAT 
TTTATATTGCCTTATCCAAAGATGGGGAAAGTAAGTCCTGACCAGGTGTTCCCACATATGCC 
TGTTACAGATAACTACATTAGGAATTCATTCTTAGCTTCTTCATCTTTGTGTGGATGTGTAT 
ACTTTACGCATCTTTCCTTTTGAGTAGAGAAATTATGTGTGTCATGTGGTCTTCTGAAAATG 
GAACACCATTCTTCAGAGCACACGTCTAGCCCTCAGCAAGACAGTTGTTTCTCCTCCTCCTT 
GCATATTTCCTACTGCGCTCCAGCCTGAGTGATAGAGTGAGACTCTGTCTCAAAAAAAAGTA 
TCTCTAAATACAGGATTATAATTTCTGCTTGAGTATGGTGTTAACTACCTTGTATTTAGAAA 
GATTTCAGATTCATTCCATCTCCTTAGTTTTCTTTTAAGGTGACCCATCTGTGATAAAAATA 
TAGCTTAGTGCTAAAATCAGTGTAACTTATACATGGCCTAAAATGTTTCTACAAATTAGAGT 
TTGTCACTTATTCCATTTGTACCTAAGAGAAAAATAGGCTCAGTTAGAAAAGGACTCCCTGG 
CCAGGCGCAGTGACTT ACGC CTGTAATCTCAGCACTTTGGGAGG CCAAGGCAGGCAGATCAC 
GAGGTCAGGAGTTCGAGACCATCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATAT 
AAAAATTAGCTGGGTGTGGTGGCAGGAGCCTGTAATCCCAGCTACACAGGAGGCTGAGGCAC 
GAGAATCACTTGAACTCAGGAGATGGAGGTTTCAGTGAGCCGAGATCACGCCACTGCACTCC 
AGCCTGGCAACAGAGCGAGACTCCATCTCAAAAAAAAAAAAAA 
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FIGURE 74 

MAARWRFWCVSVTMVVALLIVCDVPSASAQRKKEMVLSEKVSQLMEWTNKRPVIRMNGDKFR 
RLVKAPPRNYSVIVMFTALQLHRQCWCKQADEEFQIIJ^SWRYSSAFTNRIFFAMVDFDEG 
SDVFQMLNMNSAPTFINFPAKGKPKRGDTYELQVRGFSAEQIARWIADRTDVNIRVIRPPNY 
AGPLMLGLLLAVIGGLWLRRSNMEFLFNKTGWAFAALCFVLAMTSGQMWNHIRGPPYAHKN 
PHTGKVNYIHGSSQAQFVAETHIVLLFNGGVTLGMVLLCEAATSDMDIGKRKIMCVAGIGLV 
VLFFSWMLSIFRSKYHGYPYSFLMS 

Signal peptide: 

amino acids 1-29 

Transmembrane domains : 

amino acids 183-205, 217-237, 217-287, 301-321 
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FIGURE 75 

AAGCAACCAAACTGCAAGCTTTGGGAGTTGTTCGCTGTCCCTGCCCTGCTCTGCTAGGGAGA 
GAACGCCAGAGGGAGGCGGCTGGCCCGGCGGCAGGCTCTCAGAACCGCTACCGGCGATGCTA 
CTGCTGTGGGTGTCGGTGGTCGCAGCCTTGGCGCTGGCGGTACTGGCCCCCGGAGCAGGGGA 
GCAGAGGCGGAGAGCAGCCAAAGCGCCCAATGTGGTGCTGGTCGTGAGCGACTCCTTCGATG 
GAAGGTTAACATTTCATCCAGGAAGTCAGGTAGTGAAACTTCCTTTTATCAACTTTATGAAG 
ACACGTGGGACTTCCTTTCTGAATGCCTACACAAACTCTCCAATTTGTTGCCCATCACGCGC 
AGCAATGTGGAGTGGCCTCTTCACTCACTTAACAGAATCTTGGAATAATTTTAAGGGTCTAG 
ATCCAAATTATACAACATGGATGGATGTCATGGAGAGGCATGGCTACCGAACACAGAAATTT 
GGGAAACTGGACTATACTTCAGGACATCACTCCATTAGTAATCGTGTGGAAGCGTGGACAAG 
AGATGTTGCTTTCTTACTCAGACAAGAAGG CAGGCC CATGGTTAATCTTATCCGTAACAGGA 

ctaaagtcagagtgatggaaagggattggcagaatacagacaaagcagtaaactggttaaga 
aaggaagcaattaattacactgaaccatttgttatttacttgggattaaatttaccacaccc 
ttacccttcaccatcttctggagaaaattttggatcttcaacatttcacacatctctttatt 
ggcttgaaaaagtgtctcatgatgccatcaaaatcccaaagtggtcacctttgtcagaaatg 
caccctgtagattattactcttcttatacaaaaaactgcactggaagatttacaAaaaaaga 
aattaagaatattagagcattttattatgctatgtgtgctgagacagatgccatgcttggtg 

AAATTATTTTGGCCCTTCATCAATTAGATCTTCTTCAGAAAACTATTGTCATATACTCCTCA 
GACCATGGAGAGCTGGCCATGGAACATCGACAGTTTTATAAAATGAGCATGTACGAGGCTAG 
TGCACATGTTCCGCTTTTGATGATGGGACCAGGAATTAAAGCCGGCCTACAAGTATCAAATG 
TGGTTTCTCTTGTGGATATTTACCCTACCATGCTTGATATTGCTGGAATTCCTCTGCCTCAG 
AACCTGAGTGGATACTCTTTGTTGCCGTTATCATCAGAAACATTTAAGAATGAACATAAAGT 
CAAAAACCTGCATCCACCCTGGATTCTGAGTGAATTCCATGGATGTAATGTGAATGCCTCCA 
CCTACATGCTTCGAACTAACCACTGGAAATATATAGCCTATTCGGATGGTGCATCAATATTG 
CCTCAACTCTTTGATCTTTCCTCGGATCCAGATGAATTAACAAATGTTGCTGTAAAATTTCC 
AGAAATTACTTATTCTTTGGATCAGAAGCTTCATTCCATTATAAACTACCCTAAAGTTTCTG 
CTTCTGTCCACCAGTATAATAAAGAGCAGTTTATCAAGTGGAAACAAAGTATAGGACAGAAT 
TATTCAAACGTTATAGCAAATCTTAGGTGGCACCAAGACTGGCAGAAGGAACCAAGGAAGTA 
TGAAAATGCAATTGATCAGTGGCTTAAAACCCATATGAATCCAAGAGCAGTTTGAACAAAAA 
GTTTAAAAATAGTGTTCTAGAGATACATATAAATATATTACAAGATCATAATTATGTATTTT 
AAATGAAACAGTTTTAATAATTACCAAGTTTTGGCCGGGCACAGTGGCTCACACCTGTAATC 
CCAGGACTTTGGGAGGCTGAGGAAAGCAGATCACAAGGTCAAGAGATTGAGACCATCCTGGC 
CAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCTGGGCGCGGTGGTGCACA 
CCTATAGTCTCAGCTACTCAGAGGCTGAGGCAGGAGGATCGCTTGAACCCGGGAGGCAGCAG 
TTGCAGTGAGCTGAGATTGCGCCACTGTACTCCAGCCTGG CAACAGAGTGAGACTGTGTCG C 
AAAAAAATAAAAATAAAATAATAATAATTACCAATTTTTCATTATTTTGTAAGAATGTAGTG 
TATTTTAAGATAAAATGCCAATGATTATAAAATCACATATTTTCAAAAATGGTTATTATTTA 
GGCCTTTGTACAATTTCTAACAATTTAGTGGAAGTATCAAAAGGATTGAAGCAAATACTGTA 
ACAGTTATGTTCCTTTAAATAATAGAGAATATAAAATATTGTAATAATATGTATCATAAAAT 
AGTTGTATGTGAGCATTTGATGGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 76 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA64885 
<subunit 1 of 1, 536 aa, 1 stop 
<MW: 61450, pi: 9.17, NX(S/T): 7 

MLLLWSWAALALAVLAPGAGEQRRRAAKAPNVVLWSDSFDGRLTFHPGSQWKLPFINF 
MKTRGTSFLNAYTNSPICCPSRAAIWSGLFTHLTESWNNFKGLDPNYTTWMDVMERHGYRTQ 
KFGKLDYTSGHHS I SNRVEAWTRDVAFLLRQEGRPMTOLIRNRTKVRVMERDWQNTDKAVNW 
LRKEAINYTEPFVIYLGLNLPHPYPSPSSGENFGSSTFHTSLYWLEKVSHDAIKIPKWSPLS 
EMHPVDYYSSYTKNCTGRFTKKEIKNIRAFYYAMCAETDAMLGEIILALHQLDLLQKTIVIY 
SSDHGELAMEHRQFYKMSMYEASAHVPLLMMGPGIKAGLQVSNVVSLVDIYPTMLDIAGIPL 
PQNLSGYSLLPLSSETFKNEHKVKNLHPPWILSEFHGCNVNASTYMLRTNHWKYIAYSDGAS 
ILPQLFDLSSDPDELTNVAVKFPEITYSLDQKLHSIINYPKVSASVHQYNKEQFIKWKQSIG 
QNYSNVIANLRWHQDWQKEPRKYENAIDQWLKTHMNPRAV 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-111, 166-169, 193-196, 262-265, 375-378, 413-416, 
498-501 

Sulfatases proteins: 

amino acids 286-315, 359-369, 78-97 
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FIGURE 77 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAG 
AGCTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATG 
GCCTCTCTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACT 
GGTTGCCATGCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAG 
CAGTTGGCTTCTCCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAG 
TGTGACATCTATAGCACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGAT 
GGTGACATCCAGTGCAATCTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCA 
CAGTCTTCTGCCAGGAATCCCGAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTC 
ATCCTTGGAGGCCTCCTGGGATTCATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGA 
CTTCTACTCACCACTGGTGCCTGACAGCATGAAATTTGAGATTGGAGAGGCTCTTTACTTGG 
GCATTATTTCTTCCCTGTTCTCCCTGATAGCTGGAATCATCCTCTGCTTTTCCTGCTCATCC 
CAGAGAAATCGCTCCAACTACTACGATGCCTACCAAGCCCAACCTCTTGCCACAAGGAGCTC 
TCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGAGTTCAATTCCTACAGCCTGACAGGGT 
ATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTGGGTCTGTGAAAAACAGTGGACAG 
CACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGTGTCAGAAGGTGCTGCTGAGG 
ATAGACTGACTTTGGCCATTGGATTGAG CAAAGGCAGAAATGGGGG CT AGTGTAACAG CATG 
CAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCCTCACCTTGCTGCTC 
CCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCAGGACTCAGAGG 
ATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACATCCCACTG 
ACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATTGCTGG 
GGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTCCC 
TCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCC 
AGACTAATTTGTGCATGAACTGAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAG 
GATGCAGGATGGGAGGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 
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FTGTIRE 78 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA64886 
xsubunit 1 of 1, 230 aa, 1 stop 
><MW: 24549, pi: 8.56, NX(S/T): 1 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGIT 
QCDIYSTLLGLPADIQAAQAMMVTSSAISSLACIISWGMRCTVFCQESRAKDRVAVAGGVF 
FILGGLLGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYIiGIISSLFSLIAGIILCFSCS 
SQRNRSNYYDAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins, 
amino acids 46-59 
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FIGURE 79 

GCACTGCTGCTGTCCCATCAGCTGCTCTGAAGCTCCATGGTGCCCAGAATCTTCGCTCCTGC 
TTATGTGTCAGTCTGTCTCCTCCTCTTGTGTCCAAGGGAAGTCATCGCTCCCGCTGGCTCAG 
AACCATGGCTGTGCCAGCCGGCACCCAGGTGTGGAGACAAGATCTACAACCCCTTGGAGCAG 
TGCTGTTACAATGACGCCATCGTGTCCCTGAGCGAGACCCGCCAATGTGGTCCCCCCTGCAC 
CTTCTGGCCCTGCTTTGAGCTCTGCTGTCTTGATTCCTTTGGCCTCACAAACGATTTTGTTG 
TGAAGCTGAAGGTTCAGGGTGTGAATTCCCAGTGCCACTCATCTCCCATCTCCAGTAAATGT 
GAAAGCAGAAGACGTTTTCCCTGAGAAGACATAGAAAGAAAATCAACTTTCACTAAGGCATC 
TCAGAAACATAGGCTAAGGTAATATGTGTACCAGTAGAGAAGCCTGAGGAATTTACAAAATG 
ATGCAGCTCCAAGCCATTGTATGGCCCATGTGGGAGACTGATGGGACATGGAGAATGACAGT 
AGATTATCAGGAAATAAATAAAGTGGTTTTTCCAATGTACACACCTGTAAAA 
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FIGURE 80 

MVPRIFAPAYVSVCLLLLCPREVIAPAGSEPWLCQPAPRCGDKIYNPLEQCCYNDAIVSLSE 
TRQCGPPCTFWPCFELCCLDSFGLTNDFWKLKVQGVNSQCHSSPISSKCESRRRFP 

Signal peptide: 

amino acids 1-25 
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FIGURE 81 

CTCCACTGCAACCACCCAGAGCCATGGCTCCCCGAGGCTGCATCGTAGCTGTCTTTGCCATT 
TTCTGCATCTCCAGGCTCCTCTGCTCACACGGAGCCCCAGTGGCCCCCATGACTCCTTACCT 
GATGCTGTGCCAGCCACACAAGAGATGTGGGGACAAGTTCTACGACCCCCTGCAGCACTGTT 
GCTATGATGATGCCGTCGTGCCCTTGGCCAGGACCCAGACGTGTGGAAACTGCACCTTCAGA 
GTCTGCTTTGAGCAGTGCTGCCCCTGGACCTTCATGGTGAAGCTGATAAACCAGAACTGCGA 
CTCAGCCCGGACCTCGGATGACAGGCTTTGTCGCAGTGTCAGCTAATGGAACATCAGGGGAA 
CGATGACTCCTGGATTCTCCTTCCTGGGTGGGCCTGGAGAAAGAGGCTGGTGTTACCTGAGA 
TCTGGGATGCTGAGTGGCTGTTTGGGGGCCAGAGAAACACACACTCAACTGCCCACTTCATT 
CTGTGACCTGTCTGAGGCCCACCCTGCAGCTGCCCTGAGGAGGCCCACAGGTCCCCTTCTAG 
AATTCTGGACAGCATGAGATGCGTGTGCTGATGGGGGCCCAGGGACTCTGAACCCTCCTGAT 
GACCCCTATGGCCAACATCAACCCGGCACCACCCCAAGGCTGGCTGGGGAACCCTTCACCCT 
TCTGTGAGATTTTCCATCATCTCAAGTTCTCTTCTATCCAGGAGCAAAGCACAGGATCATAA 
TAAATTTATGTACTTTATAAATGAAAA 



SI 
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FIGURE 82 

MAPRGCIVAVFAIFCISRLLCSHGAPVAPMTPYLMLCQPHKRCGDKFYDPLQHCCYDDAWP 
LARTQTCGNCTFRVCFEQCCPWTFMVKLINQNCDSARTSDDRLCRSVS 

Signal peptide: 

amino acids 1-24 
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FTGIJRE 83 

GGGGGCGGGTGCCTGGAGCACGGCGCTGGGGCCGCCCGCAGCGCTCACTCGCTCGCACTCAG 
TCGCGGGAGGCTTCCCCGCGCCGGCCGCGTCCCGCCCGCTCCCCGGCACCAGAAGTTCCTCT 
GCGCGTCCGACGGCGACATGGGCGTCCCCACGGCCCTGGAGGCCGGCAGCTGGCGCTGGGGA 
TCCCTGCTCTTCGCTCTCTTCCTGGCTGCGTCCCTAGGTCCGGTGGCAGCCTTCAAGGTCGC 
CACGCCGTATTCCCTGTATGTCTGTCCCGAGGGGCAGAACGTCACCCTCACCTGCAGGCTCT 
TGGGCCCTGTGGACAAAGGGCACGATGTGACCTTCTACAAGACGTGGTACCGCAGCTCGAGG 
GGCGAGGTGCAGACCTGCTCAGAGCGCCGGCCCATCCGCAACCTCACGTTCCAGGACCTTCA 
CCTGCACCATGGAGGCCACCAGGCTGCCAACACCAGCCACGACCTGGCTCAGCGCCACGGGC 
TGGAGTCGGCCTCCGACCACCATGGCAACTTCTCCATCACCATGCGCAACCTGACCCTGCTG 
GATAGCGGCCTCTACTGCTGCCTGGTGGTGGAGATCAGGCACCACCACTCGGAGCACAGGGT 
CCATGGTGCCATGGAGCTGCAGGTGCAGACAGGCAAAGATGCACCATCCAACTGTGTGGTGT 
ACCCATCCTCCTCCCAGGATAGTGAAAACATCACGGCTGCAGCCCTGGCTACGGGTGCCTGC 
ATCGTAGGAATCCTCTGCCTCCCCCTCATCCTGCTCCTGGTCTACAAGCAAAGGCAGGCAGC 
CTCCAACCGCCGTGCCCAGGAGCTGGTGCGGATGGACAGCAACATTCAAGGGATTGAAAACC 
CCGGCTTTGAAGCCTCACCACCTGCCCAGGGGATACCCGAGGCCAAAGTCAGGCACCCCCTG 
TCCTATGTGGCCCAGCGGCAGCCTTCTGAGTCTGGGCGGCATCTGCTTTCGGAGCCCAGCAC 
CCCCCTGTCTCCTCCAGGCCCCGGAGACGTCTTCTTCCCATCCCTGGACCCTGTCCCTGACT 
CTCCAAACTTTGAGGTCATC TAG CCCAGCTGGGGGACAGTGGGCTGTTGTGGCTGGGTCTGG 
GGCAGGTGCATTTGAGCCAGGGCTGGCTCTGTGAGTGGCCTCCTTGGCCTCGGCCCTGGTTC 
CCTCCCTCCTGCTCTGGGCTCAGATACTGTGACATCCCAGAAGCCCAGCCCCTCAACCCCTC 
TGGATGCTACATGGGGATGCTGGACGGCTCAGCCCCTGTTCCAAGGATTTTGGGGTGCTGAG 
ATTCTCCCCTAGAGACCTGAAATTCACCAGCTACAGATGCCAAATGACTTACATCTTAAGAA 
GTCTCAGAACGTCCAGCCCTTCAGCAGCTCTCGTTCTGAGACATGAGCCTTGGGATGTGGCA 
GCATCAGTGGGACAAGATGGACACTGGGCCACCCTCCCAGGCACCAGACACAGGGCACGGTG 
GAGAGACTTCTCCCCCGTGGCCGCCTTGGCTCCCCCGTTTTGCCCGAGGCTGCTCTTCTGTC 
AGACTTCCTCTTTGTACCACAGTGGCTCTGGGGCCAGGCCTGCCTGCCCACTGGCCATCGCC 
ACCTTCCCCAGCTGCCTCCTACCAGCAGTTTCTCTGAAGATCTGTCAACAGGTTAAGTCAAT 
CTGGGGCTTCCACTGCCTGCATTCCAGTCCCCAGAGCTTGGTGGTCCCGAAACGGGAAGTAC 
ATATTGGGGCATGGTGGCCTCCGTGAGCAAATGGTGTCTTGGGCAATCTGAGGCCAGGACAG 
ATGTTGCCCCACCCACTGGAGATGGTGCTGAGGGAGGTGGGTGGGGCCTTCTGGGAAGGTGA 
GTGGAGAGGGGCACCTGCCCCCCGCCCTCCCCATCCCCTACTCCCACTGCTCAGCGCGGGCC 
ATTGCAAGGGTGCCACACAATGTCTTGTCCACCCTGGGACACTTCTGAGTATGAAGCGGGAT 
GCTATTAAAAACTACATGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA 
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FTGURE 84 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA64897 
xsubunit 1 of 1, 311 aa, 1 stop 
><MW: 33908, pi: 6.87, NX(S/T): 6 

MGVPTALEAGSWRWGSLLFALFLAASLGPVAAFKVATPYSLYVCPEGQNVTLTCRLLGPVDK 
GHDVTFYKTWYRSSRGEVQTCS ERRP IRNLTFQDLHLHHGGHQAANTSHDLAQRHGLESASD 
HHGNFS ITMRNLTLLDSGLYCCLWE IRHHHSEHRVHGAMELQVQTGKDAPSNCWYPSSSQ 
DSENITAAALATGACIVGILCLPLILLLVYKQRQAASNRRAQELVRMDSNIQGIENPGFEAS 
PPAQGIPEAKVRHPLSYVAQRQPSESGRHLLSEPSTPLSPPGPGDVFFPSLDPVPDSPNFEVI 

Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 190-216 
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FIGURE 85 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCT 
TTCCCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCC 
TTTCCTGCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGG 
GGTCTGTGGGTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGA 
CTCCGCTCCCGGACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTC 
TCCTCCTTGCTGGGACTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCC 
AGACATGTTCTGCCTTTTCCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACT 
TGGAGCCACAAGGCCTGATGTACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGT 
TGTTACCGCCTCCACTGTCCGCCTGTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATG 
CTGTCCCAAGTGTGTGGAACCTCACACTCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCC 
AGCACAACGGGACCATGTACCAACACGGAGAGATCTTCAGTGCCCATGAGCTGTTCCCCTCC 
CGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGCACAGAGGGCCAGATCTACTGCGGCCTCAC 
AACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCCACTGCCAGACTCCTGCTGCCAAGCCT 
GCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACAGTGTGCAGTCGCTCCATGGGGTG 
AGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAGAGAGGCCCGGGCACCCCAGC 
CCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTTCAGACCCAAGGGAGCAG 
GCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCTGTGTGCATGGCGGG 
AAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGCCCCTTGCCCTG 
CATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCCCACCGAGT 
ACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGGACAAA 
GCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTCCT 
CGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGG 
CCTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAG 
AGAGGTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGA 
AAGTCAGGAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCAC 
GAAGGTCACTGGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGA 
CAAAGTGACCAAGACATAACAAAGACCTAACAGTTGCAGATATGAGCTGTATAATTGTTGTT 
ATTATATATTAATAAATAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 86 

>< /usr /seqdb2 /s st /DNA/Dnaseqs . min/ss . DNA64 9 0 2 
xsubunit 1 of 1, 451 aa, 1 stop 
><MW: 49675, pi: 7.15, NX{S/T): 1 

MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCT 
CSEGAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIF 
SAHELFPSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEED 
SVQSLHGVRHPQDPCSSDAGRKRGPGT.PAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKH 
KiCACVHGGKTYSHGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKC 
CKICPEDKADPGHSEISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLV 
KDEETEAQRGEVPGPRPHSQNLPLDSDQESQEARLPERGTALPTARWPPRRSLE?IiPSPDPG 
AEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 87 

CTAGCCTGCGCCAAGGGGTAGTGAGACCGCGCGGCAACAGCTTGCGGCTGCGGGGAGCTCCC 
GTGGGCGCTCCGCTGGCTGTGCAGGCGGCCATGGATTCCTTGCGGAAAATGCTGATCTCAGT 
CGCAATGCTGGGCGCAGGGGCTGGCGTGGGCTACGCGCTCCTCGTTATCGTGACCCCGGGAG 
AGCGGCGGAAGCAGGAAATGCTAAAGGAGATGCCACTGCAGGACCCAAGGAGCAGGGAGGAG 
GCGGCCAGGACCCAGCAGCTATTGCTGGCCACTCTGCAGGAGGCAGCGACCACGCAGGAGAA 
CGTGGCCTGGAGGAAGAACTGGATGGTTGGCGGCGAAGGCGGCGCCAGCGGGAGGTCACCGT 
GAGACCGGACTTGCCTCCGTGGGCGCCGGACCTTGGCTTGGGCGCAGGAATCCGAGGCAGCC 
TTTCTCCTTCGTGGGCCCAGCGGAGAGTCCGGACCGAGATACCATGCCAGGACTCTCCGGGG 
TCCTGTGAGCTGCCGTCGGGTGAGCACGTTTCCCCCAAACCCTGGACTGACTGCTTTAAGGT 
CCGCAAGGCGGGCCAGGGCCGAGACGCGAGTCGGATGTGGTGAACTGAAAGAACCAATAAAA 
TCATGTTCCTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAA 
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FTGTJRE 88 

MDSLRKMLISVAMLGAGAGVGYALLVIVTPGERRKQEMLKEMPLQDPRSREEAARTQQLLLA 
TLQEAATTQENVAWRKNWMVGGEGGASGRS P 

Signal peptide: 
amino acids 1-18 
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FIGURE 89 

CAGGAGAGAAGGCACCGCCCCCACCCCGCCTCCAAAGCTAACCCTCGGGCTTGAGGGGAAGA 
GGCTGACTGTACGTTCCTTCTACTCTGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACC 
CCTCTCCTCATCTTGTTCCTTTTGTCATGGTCGGGACCCCTCCAAGGACAGCAGCACCACCT 
TGTGGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAACGGCTGGCCCAGTGCCAGGACC 
AGAGTAGTCGGCATGCTGCTGAGCTGCGGGACTTCAAGAACAAGATGCTGCCACTGCTGGAG 
GTGGCAGAGAAGGAGCGGGAGGCACTCAGAACTGAGGCCGACACCAXCTCCGGGAGAGTGGA 
TCGTCTGGAGCGGGAGGTAGACTATCTGGAGACCCAGAACCCAGCTCTGCCCTGTGTAGAGT 
TTGATGAGAAGGTGACTGGAGGCCCTGGGACCAAAGGCAAGGGAAGAAGGAATGAGAAGTAC 
GATATGGTGACAGACTGTGGCTACACAATCTCTCAAGTGAGATCAATGAAGATTCTGAAGCG 
ATTTGGTGGCCCAGCTGGTCTATGGACCAAGGATCCACTGGGGCAAACAGAGAAGATCTACG 
TGTTAGATGGGACACAGAATGACACAGCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCCTT 
GCCATGGCTGCCCGGAAAGCTTCCCGAGTCCGGGTGCCCTTCCCCTGGGTAGGCACAGGGCA 
GCTGGTATATGGTGGCTTTCTTTATTTTGCTCGGAGGGCTCCTGGAAGACCTGGTGGAGGTG 
GTGAGATGGAGAACACTTTGCAGCTAATCAAATTCCACCTGGCAAACCGAACAGTGGTGGAC 
AGCTCAGTATTCCCAGCAGAGGGGCTGATCCCCCCCTACGGCTTGACAGCAGACACCTACAT 
CGACCTGGTAGCTGATGAGGAAGGTCTTTGGGCTGTCTATGCCACCCGGGAGGATGACAGGC 
ACTTGTGTCTGG CCAAGTTAGATC CACAGACACTGGACACAG AGCAGCAGTGGGAGACACCA 
TGTCCCAGAGAGAATGCTGAGGCTGCCTTTGTCATCTGTGGGACCCTCTATGTCGTCTATAA 
CACCCGTCCTGCCAGTCGGGCCCGCATCCAGTGCTCCTTTGATGCCAGCGGCACCCTGACCC 
CTGAACGGGCAGCACTCCCTTATTTTCCCCGCAGATATGGTGCCCATGCCAGCCTCCGCTAT 
AACCCCCGAGAACGCCAGCTCTATGCCTGGGATGATGGCTACCAGATTGTCTATAAGCTGGA 
GATGAGGAAGAAAGAGGAGGAGGTTTGAGGAGCTAGCCTTGTTTTTTGCATCTTTCTCACTC 
CCATACATTTATATTATATCCCCACTAAATTTCTTGTTCCTCATTCTTCAAATGTGGGCCAG 
TTGTGGCTCAAATCCTCTATATTTTTAGCCAATGGCAATCAAATTCTTTCAGCTCCTTTGTT 
TCATACGGAACTCCAGATCCTGAGTAATCCTTTTAGAGCCCGAAGAGTCAAAACCCTCAATG 
TTCCCTCCTGCTCTCCTGCCCCATGTCAACAAATTTCAGGCTAAGGATGCCCCAGACCCAGG 
GCTCTAACCTTGTATGCGGGCAGGCCCAGGGAGCAGGCAGCAGTGTTCTTCCCCTCAGAGTG 
ACTTGGGGAGGGAGAAATAGGAGGAGACGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCT 
TCAGTGTCCTGAGGAACAGGACTTTCTCCACATTGTTTTGTATTGCAACATTTTGCATTAAA 
AGGAAAATCCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 90 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA64905 
<subunit 1 of 1, 406 aa, 1 stop 
<MW: 46038, pi: 6.50, NX(S/T): 2 

MGPSTPLLILFLLSWSGPLQGQQHHLVEYMERRLAALEERLAQCQDQSSRHAAELRDFKNKM 
LPLLEVAEKEREALRTEADTISGRVDRLEREVDYLETQNPALPCVEFDEKVTGGPGTKGKGR 
RNEKYDMVTDCGYTISQVRSMKILiKRFGGPAGLWTKDPLGQTEKIYVLDGTQNDTAFVFPRL 
RDFTLAMAARKASRVRVPFPWVGTGQLVYGGFLYFARRPPGRPGGGGEMENTLQLIKFHLAN 
RTWDSSVFPAEGLIPPYGLTADTYIDLVADEEGLWAVYATREDDRHLCLAKLDPQTLDTEQ 
QWDTPCPRENAEAAFVICGTLYWYNTRPASRARIQCSFDASGTLTPERAALPYFPRRYGAH 
ASLRYNPRERQLYAWDDGYQIVYKLEMRKKEEEV 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 177-180, 248-251 
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FIGURE 91 

gacagctgtgtctcgatggagtagactctcagaacagcgcagtttgccctccgctcacgcag 
agcctctccgtggcttccgcaccttgagcattaggccagttctcctcttctctctaatccat 
ccgtcacctctcctgtcatccgtttccatgccgtgaggtccattcacagaacacatccatgg 
ctctcatgctcagtttggttctgagtctcctcaagctgggatcagggcagtggcaggtgttt 
gggccagacaagcctgtccaggccttggtgggggaggacgcagcattctcctgtttcctgtc 
tcctaagaccaatgcagaggccatggaagtgcggttcttcaggggccagttctctagcgtgg 
tccacctctacagggacgggaaggaccagccatttatgcagatgccacagtatcaaggcagg 
acaaaactggtgaaggattctattgcggaggggcgcatctctctgaggctggaaaacattac 
tgtgttggatgctggcctctatgggtgcaggattagttcccagtcttactaccagaaggcca 
tctgggagctacaggtgtcagcactgggctcagttcctctcatttccatcacgggatatgtt 
gatagagacatccagctactctgtcagtcctcgggctggttcccccggcccacagcgaagtg 
gaaaggtccacaaggacaggatttgtccacagactccaggacaaacagagacatgcatggcc 
tgtttgatgtggagatctctctgaccgtccaagagaacgccgggagcatatcctgttccatg 
cggcatgctcatctgagccgagaggtggaatccagggtacagataggagatacctttttcga 
gcctatatcgtggcacctggctaccaaagtactgggaatactctgctgtggcctAttttttg 
gcattgttggactgaagattttcttctccaaattccagtggaaaatccaggcggaactggac 
tggagaagaaagcacggacaggcagaattgagagacgcccggaaacacgcagtggaggtgac 
tctggatccagagacggctcacccgaagctctgcgtttctgatctgaaaactgtaacccata 
gaaaagctccccaggaggtgcctcactctgagaagagatttacaaggaagagtgtggtggct 
tct cagagtttc caagcagggaaacattactgggaggtgg acggaggacacaataaaaggtg 
gcgcgtgggagtgtgccgggatgatgtggacaggaggaaggagtacgtgactttgtctcccg 
atcatgggtactgggtcctcagactgaatggagaacatttgtatttcacattaaatccccgt 
tttatcagcgtcttccccaggaccccacctacaaaaataggggtcttcctggactatgagtg 
tgggaccatctccttcttcaacataaatgaccagtcccttatttataccctgacatgtcggt 
ttgaaggcttattgaggccctacattgagtatccgtcctataatgagcaaaatggaactccc 
atagtcatctgcccagtcacccaggaatcagagaaagaggcctcttggcaaagggcctctgc 

AATCCCAGAGACAAGCAACAGTGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGG 
GTGAAATGTAGGATGAATCACATCCCACATTCTTCTTTAGGGATATTAAGGTCTCTCTCCCA 
GATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCAGATGAAGGGGGACTGGCCTGTCC 
ACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGGCTGACATTACATTT 
AGTTTGCTCTCACTCCATCTGGCTAAGTGATCTTGAAATACCACCTCTCAGGTGAAGAACCG 
TCAGGAATTC CCATCT CACAGGCTGTGGTGTAGATTAAGTAGACAAGG AATGTGAATAATGC 

TTAGATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTA 
AAAAAA 
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FIGURE 92 

MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSS 
WHLYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQK 
AIWELQVSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMH 
GLFDVE I SLTVQENAGS I SCSMRHAHLSREVE SRVQ IGDTFFEP I SWHLATKVLG ILCCGLF 
FGIVGLKIFFSKFQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVT 
HRKAPQEVPHSEKRFTRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLS 
PDHGYWVLRLNGEHLYFTLNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTC 
RFEGLLRPYIEYPSYNEQNGTPIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLP 
RGEM 

Signal peptide: 
amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 93 

GCGATGGTGCGCCCGGTGGCGGTGGCGGCGGCGGTTGCGGAGGCTTCCTTGGTCGGATTGCA 

ACGAGGAGAAGATGACTGACCAACCGACTGGCTGAATGAATGAATGGCGGAGCCGAGCGCGC 

CATGAGGAGCCTGCCGAGCCTGGGCGGCCTCGCCCTGTTGTGCTGCGCCGCCGCCGCCGCCG 

CCGTCGCCTCAGCCGCCTCGGCGGGGAATGTCACCGGTGGCGGCGGGGCCGCGGGGCAGGTG 

GACGCGTCGCCGGGCCCCGGGTTGCGGGGCGAGCCCAGCCACCCCTTCCCTAGGGCGACGGC 

TCCCACGGCCCAGGCCCCGAGGACCGGGCCCCCGCGCGCCACCGTCCACCGACCCCTGGCTG 

CGACTTCTCCAGCCCAGTCCCCGGAGACCACCCCTCTTTGGGCGACTGCTGGACCCTCTTCC 

ACCACCTTTCAGGCGCCGCTCGGCCCCTCGCCGACCACCCCTCCGGCGGCGGAACGCACTTC 

GACCACCTCTCAGGCGCCGACCAGACCCGCGCCGACCACCCTTTCGACGACCACTGGCCCGG 

CGCCGACCACCCCTGTAGCGACCACCGTACCGGCGCCCACGACTCCCCGGACCCCGACCCCC 

GATCTCCCCAGCAGCAGCAACAGCAGCGTCCTCCCCACCCCACCTGCCACCGAGGCCCCCTC 

TTCGCCTCCTCCAGAGTATGTATGTAACTGCTCTGTGGTTGGAAGCCTGAATGTGAATCGCT 

GCAACCAGACCACAGGGCAGTGTGAGTGTCGGCCAGGTTATCAGGGGCTTCACTGTGAAACC 

TGCAAAGAGGGCTTTTACCTAAATTACACTTCTGGGCTCTGTCAGCCATGTGACTGTAGTCC 

ACATGGAGCTCTCAGCATACCGTGCAACAGGTAAGCAACAGAGGGTGGAACTGAAGTTTATT 

TTATTTTAGCAAGGGAAAAAAAAAGGCTGCTACTCTCAAGGACCATACTGGTTTAAACAAAG 

GAGGATGAGGGTCATAGATTTACAAAATATTTTATATACTTTTATTCTCTTACTTTATATGT 

TATATTTAATGTCAGGATTTAAAAACATCTAATTTACTGATTTAGTTCTTCAAAAGCACTAG 

AGTCGCCAATTTTTCTCTGGGATAATTTCTGTAAATTTCATGGGAAAAAATTATTGAAGAAT 

AAATCTGCTTTCTGGAAGGGCTTTCAGGCATGAAACCTGCTAGGAGGTTTAGAAATGTTCTT 

ATGTTTATTAATATACCATTGGAGTTTGAGGAAATTTGTTGTTTGGTTTATTTTTCTCTCTA 

ATCAAAATTCTACATTTGTTTCTTTGGACATCTAAAGCTTAACCTGGGGGTACCCTAATTTA 

TTTAACTAGTGGTAAGTAGACTGGTTTTACTCTATTTACCAGTACATTTTTGAGACCAAAAG 

TAGATTAAGCAGGAATTATCTTTAAACTATTATGTTATTTGGAGGTAATTTAATCTAGTGGA 

ATAATGTACTGTTATCTAAGCATTTGCCTTGTACTGCACTGAAAGTAATTATTCTTTGACCT 

TATGTGAGGCACTTGGCTTTTTGTGGACCCCAAGTCAAAAAACTGAAGAGACAGTATTAAAT 

AATGAAAAAAATAATGACAGGTTATACTCAGTGTAACCTGGGTATAACCCAAGATCTGCTGC 

CACTTACGAGCTGTGTTCCTTGGGCAAGTAATTTCCTTTCACTGAGCTTGTTTCTTCTCAAG 

GTTGTTGTGAAGATTAAATGAGTTGATATATATAAAATGCCTAGCACATGTCACTCAATAAA 

TTCTGGTTTGTTTTAATTTCAAAGGAATATTATGGACTGAAATGAGAGAACATGTTTTAAGA 

ACTTTTAGCTCCTTGACAAAGAAGTGCTTTATACTTTAGCACTAAATATTTTAAATGCTTTA 

TAAATGATATTATACTGTTATGGAATATTGTATCATATTGTAGTTTATTAAAAATGTAGAAG 

AGGCTGGGCGCGGTGGCTCACGCCTGTAATCCTAGCACTTTGGGAGGCCAAGGCGGGTGGAT 

CACTTGAGGCCAGGAGTTCTAGATGAGCCTGGCCAGCACAGTGAAACCCCGTCTCTACTAAA 

AATACAAACAAATTAGCTGGGCGTGGTGGCACACACCTGTAGTCCCAGCTACTCGGGAGGCT 

GAGGCAGGAGAATCGGTTGAACCCGGGAGGTGGAGGTTGCAGTGAGCTGAGATCGCGCCACT 

GCACTCCAGCCTGGTGAGAGAGGGAGACTCTGTCTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FTGIJRE 94 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA64952 
xsubunit 1 of 1, 258 aa, 1 stop 
><MW: 25716, pi: 8.13, NX(S/T): 5 

MRSLPSLGGLALLCCAAAAAAVASAASAGNVTGGGGAAGQVDASPGPGLRGEPSHPFPRATA 
PTAQAPRTGPPRATVHRPLAATSPAQSPETTPLWATAGPSSTTFQAPLGPSPTTPPAAERTS 
TTSQAPTRPAPTTLSTTTGPAPTTPVATTVPAPTTPRTPTPDLPSSSNSSVLPTPPATEAPS 
SPPPEYVCNCSWGSLNVNRCNQTTGQCECRPGYQGLHCETCKEGPYLNYTSGLCQPCDCSP 

HGALSIPCNR 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites . 

amino acids 30-33, 172-175, 195-198, 208-211, 235-238 

EGF-Xike domain cysteine pattern signature. 

amino acids 214-226. 
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FTGIJRE 95 

TGCGGCGCAGTGTAGACCTGGGAGGATGGGCGGCCTGCTGCTGGCTGCTTTTCTGGCTTTGG 
TCTCGGTGCCCAGGGCCCAGGCCGTGTGGTTGGGAAGACTGGACCCTGAGCAGCTTCTTGGG 
CCCTGGTACGTGCTTGCGGTGGCCTCCCGGGAAAAGGGCTTTGCCATGGAGAAGGACATGAA 
GAACGTCGTGGGGGTGGTGGTGACCCTCACTCCAGAAAACAACCTGCGGACGCTGTCCTCTC 
AGCACGGGCTGGGAGGGTGTGACCAGAGTGTCATGGACCTGATAAAGCGAAACTCCGGATGG 
GTGTTTGAGAATCCCTCAATAGGCGTGCTGGAGCTCTGGGTGCTGGCCACCAACTTCAGAGA 
CTATGCCATCATCTTCACTCAGCTGGAGTTCGGGGACGAGCCCTTCAACACCGTGGAGCTGT 
ACAGTCTGACGGAGACAGCCAGCCAGGAGGCCATGGGGCTCTTCACCAAGTGGAGCAGGAGC 
CTGGGCTTCCTGTCACAGTAGCAGGCCCAGCTGCAGAAGGACCTCACCTGTGCTCACAAGAT 
CCTTCTGTGAGTGCTGCGTCCCCAGTAGGGATGGCGCCCACAGGGTCCTGTGACCTCGGCCA 
GTGTCCACCCACCTCGCTCAGCGGCTCCCGGGGCCCAGCACCAGCTCAGAATAAAGCGATTC 
CACAGCA 
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FIGURE 96 

MGGLLLAAFLALVSVPRAQAWLGRLDPEQLLGPWYVLAV^ 

LTPENNLRTLSSQHGLGGCDQSVMDLIKRNSGWVFENPSIGVLELWVLATNFRDYAIIFTQL 
EFGDEPFNTVELYSLTETASQEAMGLFTKWSRSLGFLSQ 

Signal peptide: 

amino acids 1-20 
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FIGURE 97 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCC 
CCTGCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTT 
CCGTGACGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCAT 
GGCTGGATTTACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGA 
CCAGGATGCTCCAGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACC 
GATTCCACCTCCTTGGGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGA 
AGAAGTGATG CGGGGAGATACTTCTT TCGTATGGAGAAAGGAAGTATAAAATGGAATTATAA 
ACATCACCGGCTCTCTGTGAATGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAG 
GCACCCTGGAGTCCGGCTGCCCCCAGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAG 
GGGACACCCCCTATGATCTCCTGGATAGGGACCTCCGTGTCCCCCCTGGACCCCTCCACCAC 
CCGCTCCTCGGTGCTCACCCTCATCCCACAGCCCCAGGACCATGGCACCAGCCTCACCTGTC 
AGGTGACCTTCCCTGGGGCCAGCGTGACCACGAACAAGACCGTCCATCTCAACGTGTCCTAC 
CCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGAGACGGCACAGTATCCACAGTCTTGGG 
AAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCTGCGCCTGGTCTGTGCAGTTGATG 
CAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGAGAGGCCTGACCCTGTGCCCC 
TCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCACCTGAGGGATGCAGCTGA 
ATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCTGAACGTCTCCCTGC 
AGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAGCCACAGCCCTG 
GTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAATCGGCAAG 
GCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTCAGCCT 
CTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAGCT 
TCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGT 
GAAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCC 
AC AGATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAG C CC CTCCAGGCAAGGGA 
GAAGTCAGAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACT 
ATGAATTATGTGCAGAGTGAAAAGCACACAGG CTTTAGAGTCAAAGTATCTCAAAC CTGAAT 
CCACACTGTGCCCTCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 98 

MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWF 
REGANTDQDAPVATNNPARAVWEETRDRPHLLGDPHTKNCTLS I RDARRSDAGRYFFRMEKG 
SIKWNYKHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVS 
PLDPSTTRSSVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDG 
TVSTVLGNGSSLSLPEGQSIiRLVCAVDAVDSNPPARLSLSWRGIiTLCPSQPSNPGVLELPWV 
HLRDAAEFTCRAQNPLGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRS 
CRKKSARPAAGVGDTGIEDANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYA 
SLSFQMVKPWDSRGQEATDTEYSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 99 

GACGCCCAGTGACCTGCCGAGGTCGGCAGCACAGAGCTCTGGAGATGAAGACCCTGTTCCTG 
GGTGTCACGCTCGGCCTGGCCGCTGCCCTGTCCTTCACCCTGGAGGAGGAGGATATCACAGG 
GACCTGGTACGTGAAGGCCATGGTGGTCGATAAGGACTTTCCGGAGGACAGGAGGCCCAGGA 
AGGTGTCCCCAGTGAAGGTGACAGCCCTGGGCGGTGGGAAGTTGGAAGCCACGTTCACCTTC 
ATGAGGGAGGATCGGTGCATCCAGAAGAAAATCCTGATGCGGAAGACGGAGGAGCCTGGCAA 
ATACAGCGCCTATGGGGGCAGGAAGCTCATGTAC CTGCAGGAGCTG CCCAGGAGGGACCACT 
ACATCTTTTACTGCAAAGACCAGCACCATGGGGGCCTGCTCCACATGGGAAAGCTTGTGGGT 
AGGAATTCTGATACCAACCGGGAGGCCCTGGAAGAATTTAAGAAATTGGTGCAGCGCAAGGG 
ACTCTCGGAGGAGGACATTTTCACGCCCCTGCAGACGGGAAGCTGCGTTCCCGAACAC TAG G 
CAGCCCCCGGGTCTGCACCTCCAGAGCCCACCCTACCACCAGACACAGAGCCCGGACCACCT 
GGACCTACCCTCCAGCCATGACCCTTCCCTGCTCCCACCCACCTGACTCCAAATAAAGTCCT 
TTTCCCCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 100 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA65404 
<subunit 1 of l, 170 aa, 1 stop 
<MW: 19457, pi: 9.10, NX(S/T): 0 

MKTLPLGVTLGLAAALSFTLEEEDITGTWYVKAMWDKDFPEDRRPRKVSPVKVTALGGGKL 
EATFTPMREDRCIQKKILMRKTEEPGKYSAYGGRKLMYLQELPRRDHYIFYCKDQHHGGLLH 
MGKLVGRNSDTNREALEEFKKLVQRKGLSEEDIFTPLQTGSCVPEH 

Important features: 
Signal peptide: 

amino acids 1-17 
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FIGURE 101 

GTTCCGCAGATGCAGAGGTTGAGGTGGCTGCGGGACTGGAAGTCATCGGGCAGAGGTCTCAC 
AGCAGCCAAGGAACCTGGGGCCCGCTCCTCCCCCCTCCAGGCCATGAGGATTCTGCAGTTAA 
TCCTGCTTGCTCTGGCAACAGGGCTTGTAGGGGGAGAGACCAGGATCATCAAGGGGTTCGAG 
TGCAAGCCTCACTCCC^GCCCTGGCAGGCAGCCCTGTTCGAGAAGACGCGGCTACTCTGTGG 
GGCGACGCTCATCGCCCCCAGATGGCTCCTGACAGCAGCCCACTGCCTCAAGCCCCGCTACA 
TAGTTCACCTGGGGCAGCACAACCTCCAGAAGGAGGAGGGCTGTGAGCAGACCCGGACAGCC 
ACTGAGTCCTTCCCCCACCCCGGCTTCAACAACAGCCTCCCCAACAAAGACCACCGCAATGA 
CATCATGCTGGTGAAGATGGCATCGCCAGTCTCCATCACCTGGGCTGTGCGACCCCTCACCC 
TCTCCTCACGCTGTGTCACTGCTGGCACCAGCTGCCTCATTTCCGGCTGGGGCAGCACGTCC 
AGCCCCCAGTTACGCCTGCCTCACACCTTGCGATGCGCCAACATCACCATCATTGAGCACCA 
GAAGTGTGAGAACGCCTACCCCGGCAACATCACAGACACCATGGTGTGTGCCAGCGTGCAGG 
AAGGGGGCAAGGACTCCTGCCAGGGTGACTCCGGGGGCCCTCTGGTCTGTAACCAGTCTCTT 
CAAGGCATTATCTCCTGGGGCCAGGATCCGTGTGCGATCACCCGAAAGCCTGGTGTCTACAC 
GAT^AGTCTGCAAATATGTGGACTGGATCCAGGAGACGATGAAGAACAATTAGACTGGACCCA 
CCCACCACAGCCCATCACCCTCCATTTCCACTTGGTGTTTGGTTCCTGTTCACTCTGTTAAT 
AAGAAACCCTAAGCCAAGACCCTCTACGAACATTCTTTGGGCCTCCTGGACTACAGGAGATG 
CTGTCACTTAATAATCAACCTGGGGTTCGAAATCAGTGAGACCTGGATTCAAATTCTGCCTT 
GAAATATTGTGACTCTGGGAATGACAACACCTGGTTTGTTCTCTGTTGTATCCCCAGCCCCA 
AAGACAGCTCCTGGCCATATATCAAGGTTTCAATAAATATTTGCTAAATGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAA 
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FTGTJRE 102 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA65405 
oubunit 1 of 1, 250 aa, 1 stop 
<MW: 27466, pi: 8.87, NX(S/T) :. 4 

MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAH 
CLKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPVSITW 
AVRPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTM 
VCASVQEGGKDSCQGDSGGPLVCNQSLQGI I SWGQDPCAITRKPGVYTKVCKYVDW IQETMKNN 

Important features : 
Signal peptide: 

amino acids 1-18 

Serine proteases, trypsin family, histidine active site, 
amino acids 58-63 

N-glycosylation sites. 

amino acids 99-102, 165-168, 181-184, 210-213 

Glycosaminoglycan attachment site. 

amino acids 145-148 

Kringle domain proteins, 
amino acids 197-209, 47-64 

Serine proteases, trypsin family, histidine protein 

amino acids 199-209, 47-63, 220-243 

Apple domain proteins 

amino acids 222-249, 189-222 
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FIGURE 103 

GAGCAGTGTTCTGCTGGAGCCGATGCCAAAAACCATGCATTTCTTATTCAGATTCATTGTTT 
TCTTTTATCTGTGGGGCCTTTTTACTGCTCAGAGACAAAAGAAAGAGGAGAGCACCGAAGAA 
GTGAAAATAGAAGTTTTGCATCGTCCAGAAAACTGCTCTAAGACAAGCAAGAAGGGAGACCT 
ACTAAATGCCCATTATGACGGCTACCTGGCTAAAGACGGCTCGAAATTCTACTGCAGCCGGA 
CACAAAATGAAGGCCACCCCAAATGGTTTGTTCTTGGTGTTGGGCAAGTCATAAAAGGCCTA 
GACATTGCTATGACAGATATGTGCCCTGGAGAAAAGCGAAAAGTAGTTATACCCCCTTCATT 
TGCATACGGAAAGGAAGGCTATGCAGAAGGCAAGATTCCACCGGATGCTACATTGATTTTTG 
AGATTGAACTTTATGCTGTGACCAAAGGACCACGGAGCATTGAGACATTTAAACAAATAGAC 
ATGGACAATGACAGGCAGCTCTCTAAAGCCGAGATAAACCTCTACTTGCAAAGGGAATTTGA 
AAAAGATGAGAAGCCACGTGACAAGTCATATCAGGATGCAGTTTTAGAAGATATTTTTAAGA 
AGAATGACCATGATGGTGATGGCTTCATTTCTCCCAAGGAATACAATGTATACCAACACGAT 
GAACTATAGCATATTTGTATTTCTACTTTTTTTTTTTAGCTATTTACTGTACTTTATGTATA 
AAACAAAGTCACTTTTCTCCAAGTTGTATTTGCTATTTTTCCCCTATGAGAAGATATTTTGA 
TCTCCCCAATACATTGATTTTGGTATAATAAATGTGAGGCTGTTTTGCAAACTTAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FTGTIRE 104 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA65406 
<subunit 1 of 1, 222 aa, 1 stop 
<MW: 25794, pi: 6.24, NX(S/T): 1 

MPKTMHFLFRFIVFFYLWGLFTAQRQKKEESTEEVKIEVLHRPENCSKTSKKGDLLNAHYDG 
YLAKDGSKFYCSRTQNEGHPKWFVLGVGQVIKGLDIAMTDMCPGEKRKWIPPSFAYGKEGY 
AEGKI PPDATLI FE IELYAVTKGPRS IETFKQ IDMDNDRQLSKAE INLYLQREFEKDEKPRD 
KS YQDAVLED I FKKNDHDGDGF I S PKEYNVYQHDEL 

Important features: 

Endoplasmic reticulum targeting sequence. 

amino acids 219-222 

N-glycosylation site. 

amino acids 45-48 

FKBP-type peptidyl -prolyl cis- trans isomerase 
amino acids 87-223, 129-142 

EF-hand calcium-binding domain proteins 

amino acids 202-214, 195-214 
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FIGURE 105 

CAGAAATGCAGGGACCATTGCTTCTTCCAGGCCTCTGCTTTCTGCTGAGCCTCTTTGGAGCT 
GTGACTCAGAAAACCAAAACTTCCTGTGCTAAGTGCCCCCCAAATGCTTCCTGTGTCAATAA 
CACTCACTGCACCTGCAACCATGGATATACTTCTGGATCTGGGCAGAAACTATTCACATTCC 
CCTTGGAGACATGTAACGCCAGGCATGGTGGCTCGCGCCTGTAATCCCAGTTCTTTGGGAAG 
CCAAGGCAGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATAGTGAAAC 
CCCGTGTCTACTAAAAATACAAAAATCAGCCGGGCGTGGTGGTGCATGCCTGCAATCCCAGT 
TACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACTCAGGAGGCAGAAGTTGCAGTGAACCC 
AGATCCTGCCATTGCACTCCAGCATGGATGACAGAGCAAGACTCCGTCTCAAAAAGAAAAGA 
TAGTTTCTTGTTTCATTTCGCGACTGCCCTCTCAGTGTTTCCTGGGATCCCCTCCCAAATAA 
AGTACTTATATTCTC 



,05/270 



WO 00/12708 



PCT/US99/20111 



FTGIJRE 106 

MQGPLLLPGLCFLLSLFGAVTQKTKTSCAKCPPNASCVNNTHCTCNHGYTSGSGQKLFTFPL 
ETCNARHGGSRL 

Signal peptide: 

amino acids 1-18 
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FIGURE 107 

CAAGCAGGTCATCCCCTTGGTGACCTTCAAAGAGAAGCAGAGAGGGCAGAGGTGGGGGGCAC 
AGGGAAAGGGTGACCTCTGAGATTCCCCTTTTCCCCCAGACTTTGGAAGTGACCCACCATGG 
GGCTCAGCATCTTTTTGCTCCTGTGTGTTCTTGGGCTCAGCCAGGCAGCCACACCGAAGATT 
TTCAATGGCACTGAGTGTGGGCGTAACTCACAGCCGTGGCAGGTGGGGCTGTTTGAGGGCAC 
CAGCCTGCGCTGCGGGGGTGTCCTTATTGACCACAGGTGGGTCCTCACAGCGGCTCACTGCA 
GCGGCAGCAGGTACTGGGTGCGCCTGGGGGAACACAGCCTCAGCCAGCTCGACTGGACCGAG 
CAGATCCGGCACAGCGGCTTCTCTGTGACCCATCCCGGCTACCTGGGAGCCTCGACGAGCCA 
CGAGCACGACCTCCGGCTGCTGCGGCTGCGCCTGCCCGTCCGCGTAACCAGCAGCGTTCAAC 
CCCTGCCCCTGCCCAATGACTGTGCAACCGCTGGCACCGAGTGCCACGTCTCAGGCTGGGGC 
ATCACCAACCACCCACGGAACCCATTCCCGGATCTGCTCCAGTGCCTCAACCTCTCCATCGT 
CTCCCATGCCACCTGCCATGGTGTGTATCCCGGGAGAATCACGAGCAACATGGTGTGTGCAG 
GCGGCGTCCCGGGGCAGGATGCCTGCCAGGGTGATTCTGGGGGCCCCCTGGTGTGTGGGGGA 
GTCCTTCAAGGTCTGGTGTCCTGGGGGTCTGTGGGGCCCTGTGGACAAGATGGCATCCCTGG 
AGTCTACACCTATATTTGCAAGTATGTGGACTGGATCCGGATGATCATGAGGAACAACTGAC 
CTGTTTCCTCCACCTCCACCCCCACCCCTTAACTTGGGTACCCCTCTGGCCCTCAGAGCACC 
AATATCTCCTCCATCACTTCCCCTAGCTCCACTCTTGTTGGCCTGGGAACTTCTTGGAACTT 
TAACTCCTGCCAGCCCTTCTAAGACCCACGAGCGGGGTGAGAGAAGTGTGCAATAGTCTGGA 
ATAAATATAAATGAAGGAGGGGCAAAAAAAAAAAAAA 
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FIGURE 108 

MGLSIFLLLCVLGLSQAATPKIFNGTECGRNSQPWQVGLFEGTSLRCGGVLIDHRWVLTAAH 
CSGSRYWVRLGEHSLSQLDWTEQIRHSGFSVTHPGYLGASTSHEHDLRLLRLRLPVRVTSSV 
QPLPLPNDCATAGTECHVSGWGITNHPRNPFPDLLQCLNLSIVSHATCHGVYPGRITSNMVC 
AGGVPGQD ACQGDSGGPLVCGG VLQGLVS WGS VG PCGQDG I PGVYT Y I CKYVDW I RM I MRNN 

Signal peptide: 

amino acids 1-17 
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FIGURE 109 

GCGGCCACACGCAGCTAGCCGGAGCCCGGACCAGGCGCCTGTGCCTCCTCCTCGTCCCTCGC 
CGCGTCCGCGAAGCCTGGAGCCGGCGGGAGCCCCGCGCTCGCCATGTCGGGCGAGCTCAGCA 
ACAGGTTCCAAGGAGGGAAGGCGTTCGGCTTGCTCAAAGCCCGGCAGGAGAGGAGGCTGGCC 
GAGATCAACCGGGAGTTTCTGTGTGACCAGAAGTACAGTGATGAAGAGAACCTTCCAGAAAA 
GCTCACAGCCTTCAAAGAGAAGTACATGGAGTTTGACCTGAACAATGAAGGCGAGATTGACC 
TGATGTCTTTAAAGAGGATGATGGAGAAGCTTGGTGTCCCCAAGACCCACCTGGAGATGAAG 
AAGATGATCTCAGAGGTGACAGGAGGGGTCAGTGACACTATATCCTACCGAGACTTTGTGAA 
CATGATGCTGGGGAAACGGTCGGCTGTCCTCAAGTTAGTCATGATGTTTGAAGGAAAAGCCA 
ACGAGAGCAGCCCCAAGCCAGTTGGCCCCCCTCCAGAGAGAGACATTGCTAGCCTGCCCTGA 
GGACCCCGCCTGGACTCCCCAGCCTTCCCACCCCATACCTCCCTCCCGATCTTGCTGCCCTT 
CTTGACACACTGTGATCTCTCTCTCTCTCATTTGTTTGGTCATTGAGGGTTTGTTTGTGTTT 
TCATCAATGTCTTTGTAAAGCACAAATTATCTGCCTTAAAGGGGCTCTGGGTCGGGGAATCC 
TGAGCCTTGGGTCCCCTCCCTCTCTTCTTCCCTCCTTCCCCGCTCCCTGTGCAGAAGGGCTG 
ATATCAAACCAAAAACTAGAGGGGGCAGGGCCAGGGCAGGGAGGCTTCCAGCCTGTGTTCCC 
CTCACTTGGAGGAACCAGCACTCTCCATCCTTTCAGAAAGTCTCCAAGCCAAGTfCAGGCTC 
ACTGACCTGGCTCTGACGAGGACCCCAGGCCACTCTGAGAAGACCTTGGAGTAGGGACAAGG 
CTGCAGGGCCTCTTTCGGGTTTCCTTGGACAGTGCCATGGTTCCAGTGCTCTGGTGTCACCC 
AGGACACAGCCACTCGGGGCCCCGCTGCCCCAGCTGATCCCCACTCATTCCACACCTCTTCT 
CATCCTCAGTGATGTGAAGGTGGGAAGGAAAGGAGCTTGGCATTGGGAGCCCTTCAAGAAGG 
TACCAGAAGGAACCCTCCAGTCCTGCTCTCTGGCCACACCTGTGCAGGCAGCTGAGAGGCAG 
CGTGCAGCCCTACTGTCCCTTACTGGGGCAGCAGAGGGCTTCGGAGGCAGAAGTGAGGCCTG 
GGGTTTGGGGGGAAAGGTCAGCTCAGTGCTGTTCCACCTTTTAGGGAGGATACTGAGGGGAC 
CAGGATGGGAGAATGAGGAGTAAAATGCTC ACGG CAAAGTCAGCAG CACTGGTAAGCCAAGA 
CTGAGAAATAGAAGGTTGCTTGTCTGACCCCAATCTGCTTGAAAAAAAAAAAAAAAAAA 
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FTGIJRE 110 

MSGELSNRFQGGKAFGLLKARQERRLAEINREFLCDQKYSDEENLPEKLTAFKEKYMEFDLN 
NEGE IDLMSLKRMMEKLGVPKTHLEMKKMI SE VTGGVSDT I S YRDFVNMMLGKRS AVLKLVM 
MFEGKANESSPKPVGPPPERDIASLP 
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FTOTTRE111A 

CGCGCTCCCCGCGCGCCTCCTCGGGCTCCACGCGTCTTGCCCCGCAGAGGCAGCCTCCTCCA 

GGAGCGGGGCCCTGCACACC ATG GCCCCCGGGTGGGCAGGGGTCGGCGCCGCCGTGCGCGCC 

CGCCTGGCGCTGGCCTTGGCGCTGGCGAGCGTCCTGAGTGGGCCTCCAGCCGTCGCCTGCCC 

CACCAAGTGTACCTGCTCCGCTGCCAGCGTGGACTGCCACGGGCTGGGCCTCCGCGCGGTTC 

CTCGGGGCATCCCCCGCAACGCTGAGCGCCTTGACCTGGACAGAAATAATATCACCAGGATC 

ACCAAGATGGACTTCGCTGGGCTCAAGAACCTCCGAGTCTTGCATCTGGAAGACAACCAGGT 

CAGCGTCATCGAGAGAGGCGCCTTCCAGGACCTGAAGCAGCTAGAGCGACTGCGCCTGAACA 

AGAATAAGCTGCAAGTCCTTCCAGAATTGCTTTTCCAGAGCACGCCGAAGCTCACCAGACTA 

GATTTGAGTGAAAACCAGATCCAGGGGATCCCGAGGAAGGCGTTCCGCGGCATCACCGATGT 

G AAG AACCTG CAACTGGACAACAACC ACAT CAGCTG CATTGAAGATGGAGCCTTCCGAG CGC 

TGCGCGATTTGGAGATCCTTACCCTCAACAACAACAACATCAGTCGCATCCTGGTCACCAGC 

TTCAACCACATGCCGAAGATCCGAACTCTGCGCCTCCACTCCAACCACCTCTACTGCGACTG 

CCACCTGGCCTGGCTCTCGGATTGGCTGCGACAGCGACGGACAGTTGGCCAGTTCACACTCT 

GCATGGCTCCTGTGCATTTGAGGGGCTTCAACGTGGCGGATGTGCAGAAGAAGGAGTACGTG 

TGCCCAGCCCCCCACTCGGAGCCCCCATCCTGCAATGCCAACTCCATCTCCTGCCCTTCGCC 

CTGCACGTGCAGCAATAACATCGTGGACTGTCGAGGAAAGGGCTTGATGGAGATTCCTGCCA 

ACTTGCCGGAGGGCATCGTCGAAATACGCCTAGAACAGAACTCCATCAAAGCCATCCCTGCA 

GGAGCCTTCACCCAGTACAAGAAACTGAAGCGAATAGACATCAGCAAGAATCAGATATCGGA 

TATTGCTCCAGATGCCTTCCAGGGCCTGAAATCACTCACATCGCTGGTCCTGTATGGGAACA 

AGATCACCGAGATTGCCAAGGGACTGTTTGATGGGCTGGTGTCCCTACAGCTGCTCCTCCTC 

AATGCCAACAAGATCAACTGCCTGCGGGTGAACACGTTTCAGGACCTGCAGAACCTCAACTT 

GCTCTCCCTGTATGACAACAAGCTGCAGACCATCAGCAAGGGGCTCTTCGCCCCTCTGCAGT 

CCATCCAGACACTCCACTTAGCCCAAAACCCATTTGTGTGCGACTGCCACTTGAAGTGGCTG 

GCCGACTACCTCCAGGACAACCCCATCGAGACAAGCGGGGCCCGCTGCAGCAGCCCGCGCCG 

ACTCGCCAACAAGCGCATCAGCCAGATCAAGAGCAAGAAGTTCCGCTGCTCAGGCTCCGAGG 

ATTACCGCAGCAGGTTCAGCAGCGAGTGCTTCATGGACCTCGTGTGCCCCGAGAAGTGTCGC 

TGTGAGGGCACGATTGTGGACTGCTCCAACCAGAAGCTGGTCCGCATCCCAAGCCACCTCCC 

TGAATATGTCACCGACCTGCGACTGAATGACAATGAGGTATCTGTTCTGGAGGCCACTGGCA 

TCTTCAAGAAGTTG CC CAACCTGCGGAAAATAAATCTGAGTAACAATAAGATCAAGGAGGTG 

CGAGAGGGAGCTTTCGATGGAGCAGCCAGCGTGCAGGAGCTGATGCTGACAGGGAACCAGCT 

GGAGACCGTGCACGGGCGCGTGTTCCGTGGCCTCAGTGGCCTCAAAACCTTGATGCTGAGGA 

GTAACTTGATCAGCTGTGTGAGTAATGACACCTTTGCCGGCCTGAGTTCGGTGAGACTGCTG 

TCCCTCTATGACAATCGGATCACCACCATCACCCCTGGGGCCTTCACCACGCTTGTCTCCCT 

GTCCACCATAAACCTCCTGTCCAACCCCTTCAACTGCAACTGCCACCTGGCCTGGCTCGGCA 

AGTGGTTGAGGAAGAGGCGGATCGTCAGTGGGAACCCTAGGTGCCAGAAGCCATTTTTCCTC 

AAGGAGATTCCCATCCAGGATGTGGCCATCCAGGACTTCACCTGTGATGGCAACGAGGAGAG 

TAGCTGCCAGCTGAGCCCGCGCTGCCCGGAGCAGTGCACCTGTATGGAGACAGTGGTGCGAT 

GCAGCAACAAGGGGCTCCGCGCCCTCCCCAGAGGCATGCCCAAGGATGTGACCGAGCTGTAC 

CTGGAAGGAAACCACCTAACAGCCGTGCCCAGAGAGCTGTCCGCCCTCCGACACCTGACGCT 

TATTGACCTGAGCAACAACAGCATCAGCATGCTGACCAATTACACCTTCAGTAACATGTCTC 

ACCTCTCCACTCTGATCCTGAGCTACAACCGGCTGAGGTGCATCCCCGTCCACGCCTTCAAC 

GGGCTGCGGTCCCTGCGAGTGCTAACCCTCCATGGCAATGACATTTCCAGCGTTCCTGAAGG 

CTCCTTCAACGACCTCACATCTCTTTCCCATCTGGCGCTGGGAACCAACCCACTCCACTGTG 

ACTGCAGTCTTCGGTGGCTGTCGGAGTGGGTGAAGGCGGGGTACAAGGAGCCTGGCATCGCC 

CGCTGCAGTAGCCCTGAGCCCATGGCTGACAGGCTCCTGCTCACCACCCCAACCCACCGCTT 

CCAGTGCAAAGGGCCAGTGGACATCAACATTGTGGCCAAATGCAATGCCTGCCTCTCCAGCC 

CGTGCAAGAATAACGGGACATGCACCCAGGACCCTGTGGAGCTGTACCGCTGTGCCTGCCCC 
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FTGTJRE 111B 



TACAGCTACAAGGGCAAGGACTGCACTGTGCCCATCAACACCTGCATCCAGAACCCCTGTCA 
GCATGGAGGCACCTGCCACCTGAGTGACAGCCACAAGGATGGGTTCAGCTGCTCCTGCCCTC 
TGGGCTTTGAGGGGCAGCGGTGTGAGATCAACCCAGATGACTGTGAGGACAACGACTGCGAA 
AACAATGCCACCTGCGTGGACGGGATCAACAACTACGTGTGTATCTGTCCGCCTAACTACAC 
AGGTGAGCTATGCGACGAGGTGATTGACCACTGTGTGCCTGAGCTGAACCTCTGTCAGCATG 
AGGCCAAGTGCATCCCCCTGGACAAAGGATTCAGCTGCGAGTGTGTCCCTGGCTACAGCGGG 
AAGCTCTGTGAGACAGACAATGATGACTGTGTGGCCCACAAGTGCCGCCACGGGGCCCAGTG 
CGTGGACACAATCAATGGCTACACATGCACCTGCCCCCAGGGCTTCAGTGGACCCTTCTGTG 
AACACCCCCCACCCATGGTCCTACTGCAGACCAGCCCATGCGACCAGTACGAGTGCCAGAAC 
GGGGCCCAGTGCATCGTGGTGCAGCAGGAGCCCACCTGCCGCTGCCCACCAGGCTTCGCCGG 
CCCCAGATGCGAGAAGCTCATCACTGTCAACTTCGTGGGCAAAGACTCCTACGTGGAACTGG 
CCTCCGCCAAGGTCCGACCCCAGGCCAACATCTCCCTGCAGGTGGCCACTGACAAGGACAAC 
GGCATCCTTCTCTACAAAGGAGACAATGACCCCCTGGCACTGGAGCTGTACCAGGGCCACGT 
GCGGCTGGTCTATGACAGCCTGAGTTCCCCTCCAACCACAGTGTACAGTGTGGAGACAGTGA 
ATGATGGGCAGTTTCACAGTGTGGAGCTGGTGACGCTAAACCAGACCCTGAACCtAGTAGTG 
GACAAAGGAACTCCAAAGAGCCTGGGGAAGCTCCAGAAGCAGCCAGCAGTGGGCATCAACAG 
CCCCCTCTACCTTGGAGGCATCCCCACCTCCACCGGCCTCTCCGCCTTGCGCCAGGGCACGG 
ACCGGCCTCTAGGCGGCTTCCACGGATGCATCCATGAGGTGCGCATCAACAACGAGCTGCAG 
GACTTCAAGGCCCTCCCACCACAGTCCCTGGGGGTGTCACCAGGCTGCAAGTCCTGCACCGT 
GTGCAAGCACGGCCTGTGCCGCTCCGTGGAGAAGGACAGCGTGGTGTGCGAGTGCCGCCCAG 
GCTGGACCGGCCCACTCTGCGACCAGGAGGCCCGGGACCCCTGCCTCGGCCACAGATGCCAC 
CATGGAAAATGTGTGGCAACTGGGACCTCATACATGTGCAAGTGTGCCGAGGGCTATGGAGG 
GGACTTGTGTGACAACAAGAATGACTCTGCCAATGCCTGCTCAGCCTTCAAGTGTCACCATG 
GGCAGTGCCACATCTCAGACCAAGGGGAGCCCTACTGCCTGTGCCAGCCCGGCTTTAGCGGC 
GAGCACTGCCAACAAGAGAATCCGTGCCTGGGACAAGTAGTCCGAGAGGTGATCCGCCGCCA 
GAAAGGTTATGCATCATGTGCCACAGCCTCCAAGGTGCCCATCATGGAATGTCGTGGGGGCT 
GTGGGCCCCAGTGCTGCCAGCCCACCCGCAGCAAGCGGCGGAAATACGTCTTCCAGTGCACG 
GACGGCTCCTCGTTTGTAGAAGAGGTGGAGAGACACTTAGAGTGCGGCTGCCTCGCGTGTTC 
CTAAGCCCCTGCCCGCCTGCCTGCCACCTCTCGGACTCCAGCTTGATGGAGTTGGGACAGCC 
ATGTGGGACC CCCTGGTGATTCAG CATGAAGGAAATGAAGCTGGAGAGGAAGGTAAAGAAGA 
AGAGAATATTAAGTATATTGTAAAATAAACAAAAAATAGAACTTAAAAAAAAAAAAAAAAAA 
AAAAAA 
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FIGURE 112 

MAPGWAGVGAAVRARLALALALAS VLSGPPAVAC PTKCTCSAASVDCHGLGLRAVPRG I PRN 
AERLDLDRNNITRITKMDFAGLKNLRVLHLEDNQVSVIERGAFQDLKQLERLRLNKNKLQVL 
PELLFQS TPKLTRLDLS ENQ IQG I PRKAFRG I TD VKNLQLDNNH I S CI EDGAFRALRDLE I L 
TLNNNNI SRI LVTS FNHMPKI RTLRLHSNHLYCDCHLAWLSDWLRQRRTVGQFTLCMAPVHL 
RGFNVADVQKKEYVCPAPHSEPPSCNANSISCPSPCTCSNNIVDCRGKGLMEIPANLiPEGIV 
E I RLEQNS I KAI PAG AFTQ Y KKLKR I D I S KNQ I S D I APDAFQGLKS LTSLVL YGNK I TE I AK 
GLFDGLVSLQLLLLNANKINCLRWTFQDLQNLNLLSLYDNKLQTI SKGLFAPLQS I QTLHL 
AQNPFVCDCHLKWLADYLQDNPIETSGARCSSPRRLANKRISQIKSKKFRCSGSEDYRSRFS 
SECFMDLVCPEKCRCEGTIVDCSNQKLVRIPSHLPEYVTDLRLNDNEVSVLEATGIFKKLPN 
LRKINLSNNKIKEVREGAFDGAASVQELMLTGNQLETVHGRVFRGLSGLKTLML?SNLISCV 
SNDT FAGLS S VRLLSLYDNR I TT I TPGAFTTL VSLS T I NLLS NPFNCNCHLAWLGKWLRKRR 
IVSGNPRCQKPFFLKEIPIQDVAIQDFTCDGNEESSCQLSPRCPEQCTCMETWRCSNKGLR 
ALPRGMPKDVTELYLEGNHLTAVPRELSALRHLTLIDLSNNSISMLTNYTFSNMSHLSTLIL 
S YNRLRC I PVHAFNGLRSLRVLTLHGND I S S VPEGSFNDLTS LSHLALGTNPLHCDCSLRWL 
SEWVKAGYKEPGIARCSSPEPMADRLLLTTPTHRFQCKGPVDINIVAKCNACLSSPCKNNGT 
CTQDPVELYRCACPYSYKGKDCTVPINTCIQNPCQHGGTCHLSDSHKDGFSCSCPLGFEGQR 
CE INPDD CEDNDCENNAT CVDG INNYVC I C P PNYTGELCDE VI DHCVPELNLCQHEAKC I PL 
DKGFSCECVPGYSGKLCETDNDDCVAHKCRHGAQCVDTINGYTCTCPQGFSGPFCEHPPPMV 
LLQTSPCDQYECQNGAQCIWQQEPTCRCPPGFAGPRCEKLITVNFVGKDSYVELASAKVRP 
QANISLQVATDKDNGILLYKGDNDPLALELYQGHVRLVYDSLSSPPTTVYSVETVNDGQFHS 
VELVTLNQTLNLWDKGTPKSLGKLQKQPAVGINSPLYLGGIPTSTGLSALRQGTDRPLGGF 
HGCIHEVRINNELQDFKALPPQSIiGVSPGCKSCTVCKHGLCRSVEKDSWCECRPGWTGPLC 
DQEARDPCLGHRCHHGKCVATGTSYMCKCAEGYGGDLCDNKNDSANACSAFKCHHGQCHISD 
QGEPYCLCQPGFSGEHCQQENPCLGQWREVIRRQKGYASCATASKVPIMECRGGCGPQCCQ 
PTRSKRRKYVFQCTDGSSFVEEVERHLECGCLACS 

Signal peptide: 

amino acids 1-27 
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FIGURE 113 

GGATGCAGGACGCTCCCCTGAGCTGCCTGTCACCGACTAGGTGGAGCAGTGTTTCTTCCGCA 
GACTCAACTGAGAAGTCAGCCTCTGGGGCAGGCACCAGGAATCTGCCTTTTCAGTTCTGTCT 
CCGGCAGGCTTTGAGGATGAAGGCTGCGGGCATTCTGACCCTCATTGGCTGCCTGGTCACAG 
GCGCCGAGTCCAAAATCTACACTCGTTGCAAACTGGCAAAAATATTCTCGAGGGCTGGCCTG 
GACAATTACTGGGGCTTCAG CCTTGGAAACTGGATCTG CATGGCAT ATTATGAGAGCGGCTA 
CAACACCACAGCCC CG ACGGTC CTGGATGACGGCAGCATCGACTATGG CATCTTCCAGATCA 
ACAGCTTCGCGTGGTGCAGACGCGGAAAGCTGAAGGAGAACAACCACTGCCATGTCGCCTGC 
TCAGCCTTGATCACTGATGACCTCACAGATGCAATTATCTGTGCCAGGAAAATTGTTAAAGA 
GACACAAGGAATGAACTATTGGCAAGGCTGGAAGAAACATTGTGAGGGCAGAGACCTGTCCG 
AGTGGAAAAAAGGCTGTGAGGTTTCCTAAACTGGAACTGGACCCAGGATGCTTTGCAGCAAC 
GCCCTAGGATTTGCAGTGAATGTCCAAATGCCTGTGTCATCTTGTCCCGTTTCCTCCCAATA 
TTCCTTCTCAAACTTGGAGAGGGAAAATTAAGCTATACTTTTAAGAAAATAAATATTTCCAT 
TTAAATGTC 
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FIGURE 114 

MKAAGILTLIGCLVTGAESKI YTRCKLAKI FSRAGLDNYWGFSLGNWI CMAYYESGYNTTAP 
TVLDDGSIDYGI FQINSFAWCRRGKLKENNHCHVACSALI TDDLTDAI I CARKI VKETQGMN 
YWQGWKKHCEGRDLSEWKKGCEVS 

Signal peptide: 

amino acids 1-19 
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FTGTTRE 115 

CAGGCCATTTGCATCCCACTGTCCTTGTGTTCGGAGCCAGGCCACACCGTCCTCAGCAGTGT 

CATGTGTTAAAAACGCCAAGCTGAATATATCATGCCCCTATTAAAACTTGTACATGGCTCCC 

CATTGGTTTTTGGAGAAAAGTTCAAGCTTTTTACCTTGGTGTCTGCCTGTATCCCAGTGTTC 

AGGCTGGCTAGACGGCGGAAGAAGATCCTATTTTACTGTCACTTCCCAGATCTGCTTCTCAC 

CAAGAGAGATTCTTTTCTTAAACGACTATACAGGGCCCCAATTGACTGGATAGAGGAATACA 

CCACAGGCATGGCAGACTGCATCTTAGTCAACAGCCAGTTCACAGCTGCTGTTTTTAAGGAA 

ACATTCAAGTCCCTGTCTCACATAGACCCTGATGTCCTCTATCCATCTCTAAATGTCACCAG 

CTTTGACTCAGTTGTTCCTGAAAAGCTGGATGACCTAGTCCCCAAGGGGAAAAAATTCCTGC 

TGCTCTCCATCAACAGATACGAAAGGAAGAAAAATCTGACTTTGGCACTGGAAGCCCTAGTA 

CAGCTGCGTGGAAGATTGACATCCCAAGATTGGGAGAGGGTTCATCTGATCGTGGCAGGTGG 

TTATGACGAGAGAGTCCTGGAGAATGTGGAACATTATCAGGAATTGAAGAAAATGGTCCAAC 

AGTCCGACCTTGGCCAGTATGTGACCTTCTTGAGGTCTTTCTCAGACAAACAGAAAATCTCC 

CTCCTCCACAGCTGCACGTGTGTGCTTTACACACCAAGCAATGAGCACTTTGGCATTGTCCC 

TCTGGAAGCCATGTACATGCAGTGCCCAGTCATTGCTGTTAATTCGGGTGGACCCTTGGAGT 

CCATTGACCACAGTGTCACAGGGTTTCTGTGTGAGCCTGACCCGGTGCACTTCTCAGAAGCA 

ATAGAAAAGTTCATCCGTGAACCTTCCTTAAAAGCCACCATGGGCCTGGCTGGAAGAGCCAG 

AGTGAAGGAAAAATTTTCCCCTGAAGCATTTACAGAACAGCTCTACCGATATGTTACCAAAC 

TGCTGGTATAATCAGATTGTTTTTAAGATCTCCATTAATGTCATTTTTATGGATTGTAGACC 

CAGTTTTGAAACCAAAAAAGAAACCTAGAATCTAATGCAGAAGAGATCTTTTAAAAAATAAA 

CTTGAGTCTTGAATGTGAGCCACTTTCCTATATACCACACCTCCCTGTCCACTTTTCAGAAA 

AACCATGTCTTTTATGCTATAATCATTCCAAATTTTGCCAGTGTTAAGTTACAAATGTGGTG 

TCATT C CATGTTCAGCAGAGTATTTTAATTATATTTTCTCGGGATTATTGCT CTTCTGTCTA 

TAAATTTTGAATGATACTGTGCCTTAATTGGTTTTCATAGTTTAAGTGTGTATCATTATCAA 

AGTTGATTAATTTGGCTTCATAGTATAATGAGAGCAGGGCTATTGTAGTTCCCAGATTCAAT 

CCACCGAAGTGTTCACTGTCATCTGTTAGGGAATTTTTGTTTGTCCTGTCTTTGCCTGGATC 

CATAGCGAGAGTGCTCTGTATTTTTTTTAAGATAATTTGTATTTTTGCACACTGAGATATAA 

TAAAAGGTGTTTATCATAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 

MPLLKLVHGSPLVFGEKFKLFTLVSACIPVFRLARRRKKILFYCHFPDLLLTKRDSFLKRLY 
RAPIDWIEEYTTGMADCILVNSQFTAAVFKETFKSLSHIDPDVLYPSLNVTSFDSVVPEKLD 
DLVPKGKKFLLLSINRYERKK^LTLALEALVQLRGRLTSQDWERVHLIVAGGYDERVLENVE 
HYQELKKMVQQSDLGQYVTFLRSFSDKQKISLLHSCTCVLYTPSNEHFGIVPLEAMYMQCPV 

IAVNSGGPLESIDHSVTGFLCEPDPVHFSEAIEKFIREPSLKATMGLAGRARVKEKFSPEAF 
TEQLYRYVTKLLV 

Signal peptide: 

amino acids 1-15 
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FIGURE 117 

GACT ACGC CGAT CCGAGACGTGGCTCCCTGGGCGGC AG AACCATGTTGG ACTT CG CG AT CTT 
CGCCGTTACCTTCTTGCTGGCGTTGGTGGGAGCCGTGCTCTACCTCTATCCGGCTTCCAGAC 
AAGCTGCAGGAATTCCAGGGATTACTC CAACTGAAG AAAAAGATGGTAAT CTTC CAGATATT 
GTGAATAGTGGAAGTTTGCATGAGTTCCTGGTTAATTTGCATGAGAGATATGGGCCTGTGGT 
CTCCTTCTGGTTTGGCAGGCGCCTCGTGGTTAGTTTGGGCACTGTTGATGTACTGAAGCAGC 
ATATCAATCCCAATAAGACATCGGACCCTTTTGAAACCATGCTGAAGTCATTATTAAGGTAT 
CAATCTGGTGGTGGCAGTGTGAGTGAAAACCACATGAGGAAAAAATTGTATGAAAATGGTGT 
GACTGATTCTCTGAAGAGTAACTTTGCCCTCCTCCTAAAGCTTTCAGAAGAATTATTAGATA 
AATGGCTCTCCTACCCAGAGACCCAGCACGTGCCCCTCAGCCAGCATATGCTTGGTTTTGCT 
ATGAAGTCTGTTACACAGATGGTAATGGGTAGTACATTTGAAGATGATCAGGAASTCATTCG 
CTTC CAG AAGAAT C ATGG C AC AGTTTGGTCTGAG ATTGG AAAAGGCTTTCTAG ATGGGTC AC 
TTGATAAAAACATGACTCGGAAAAAACAATATGAAGATGCCCTCATGCAACTGGAGTCTGTT 
TTAAGGAACATCATAAAAGAACGAAAAGGAAGGAACTTCAGTCAACATATTTTCATTGACTC 
CTTAGTACAAGGGAACCTTAATGACCAACAGATCCTAGAAGACAGTATGATATTTTCTCTGG 
CCAGTTGCATAATAACTGCAAAATTGTGTACCTGGGCAATCTGTTTTTTAACCACCTCTGAA 
GAAGTTCAAAAAAAATTATATGAAGAGATAAACCAAGTTTTTGGAAATGGTCCTGTTACTCC 
AGAGAAAATTGAGCAGCTCAGATATTGTCAGCATGTGCTTTGTGAAACTGTTCGAACTGCCA 
AACTGACTCCAGTTTCTGCCCAGCTTCAAGATATTGAAGGAAAAATTGACCGATTTATTATT 
CCTAGAGAGACCCTCGTCCTTTATGCCCTTGGTGTGGTACTTCAGGATCCTAATACTTGGCC 
ATCTCCACACAAGTTTGATCCAGATCGGTTTGATGATGAATTAGTAATGAAAACTTTTTCCT 
CACTTGGATTCTCAGGCACACAGGAGTGTCCAGAGTTGAGGTTTGCATATATGGTGACCACA 
GTACTTCTTAGTGTATTGGTGAAGAGACTGCACCTACTTTCTGTGGAGGGACAGGTTATTGA 
AACAAAGTATGAACTGGTAACATCATCAAGGGAAGAAGCTTGGATCACTGTCTCAAAGAGAT 
ATTAAAATTTTATACATTTAAAATCATTGTTAAATTGATTGAGGAAAACAACCATTTAAAAA 
AAATCTATGTTGAATCCTTTTATAAACCAGTATCACTTTGTAATATAAACACCTATTTGTAC 
TTAA 



Mr/*?* 



WO 00/12708 



PCT/US99/20111 



FIGURE 118 

MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGNLPDIVNSGSLHEFLVNLH 
ERYGPWSFWFGRRLWSLGTVDVLKQHINPNKTSDPFETMLKSLLRYQSGGGSVSENHMRK 
KLYENGVTDSLKSNFALLLKLSEELLDKV^SYPETQHVPLSQHMLGFAMKSVTQMVMGSTFE 
DDQEVIRFQKNHGTVWSE IGKGFLDGSLDKNMTRKKQYEDALMQLESVLRNI I KERKGRNFS 
QHIFIDSLVQGNLNDQQILEDSMIFSLASCIITAKLCTWAICFLTTSEEVQKKLYEEINQVF 
GNGPVTPEKIEQLRYCQHVLCETVRTAKLTPVSAQLQDIEGKIDRFXIPRETLVLYALGWL 
QDPNTWPSPHKFDPDRFDDELVMKTFSSLGFSGTQECPELRFAYKVTTVLLSVLVKRLHLLS 
VEGQVIETKYELVTSSREEAWITVSKRY 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 
amino acids 271-2 90 



ill/no 
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FTOTTRE 119 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGA 
GACCGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTC 
CTGACGCTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACAT 
ACAGGCCTGCCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGG 
TGGCCGCGCTCTCTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGA 
GTCTCCATGTTCAACAGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGT 
GGCCCTGTCCTTCTTCATATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCT 
TCTGCAGTGCCCTTCCAGCTGTCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAA 
AAGAAACCCTTCTGATTACCTTCATGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGG 
CCGCTTCGTATTCCTGGAAGAAGGAAGGCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTC 
TGCTGGAGGATATGTGTTGGAATAATTACGTCTTGAGTCTGGGATTATCCGCATTGTATTTA 
GTGCTTTGTAATAAAATATGTTTTGTAGTAACATTAAGACTTATATACAGTTTTAGGGGACA 

ATTAAAAAAAAAAAA 
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FTQITRE 120 

MGRVSGLVPSRFLTLLAHLVWITLFVJSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLG 
LFAVELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVT 
EMALFVTVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE 121 

TCCCGGACCCTGCCGCCCTGCCACTATGTCCCGCCGCTCTATGCTGCTTGCCTGGGCTCTCC 
CCAGCCTCCTTCGACTCGGAGCGGCTCAGGAGACAGAAGACCCGGCCTGCTGCAGCCCCATA 
GTGCCCCGGAACGAGTGGAAGGCCCTGGCATCAGAGTGCGCCCAGCACCTGAGCCTGCCCTT 
ACGCTATGTGGTGGTATCGCACACGGCGGGCAGCAGCTGCAACACCCCCGCCTCGTGCCAGC 
AGCAGGCCCGGAATGTGCAGCACTACCACATGAAGACACTGGGCTGGTGCGACGTGGGCTAC 
AACTTCCTGATTGGAGAAGACGGGCTCGTATACGAGGGCCGTGGCTGGAACTTCACGGGTGC 
CCACTCAGGTCACTTATGGAACCCCATGTCCATTGGCATCAGCTTCATGGGCAACTACATGG 
ATCGGGTGCCCACACCCCAGGCCATCCGGGCAGCCCAGGGTCTACTGGCCTGCGGTGTGGCT 
CAGGGAGCCCTGAGGTCCAACTATGTGCTCAAAGGACACCGGGATGTGCAGCGTACACTCTC 
TCCAGGCAACCAGCTCTACCACCTCATCCAGAATTGGCCACACTACCGCTCCCCCTGAGGCC 
CTGCTGATCCGCACCCCATTCCTCCCCTCCCATGGCCAAAAACCCCACTGTCTCCTTCTCCA 
ATAAAGATGTAGCTC 
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FIGURE 122 

MSRRSMLLAWALPSLLRLGAAQETEDPACCSPIVPRNEWKALASECAQHLSLPLRYWVSHT 
AGSSCNTPASCQQQARNVQHYHMKTLGWCDVGYNFLIGEDGLVYEGRGWNFTGAHSGHLWNP 
MS IGISFMGNYMDRVPTPQAIRAAQGLLACGVAQGALRSNYVLKGHRDVQRTLS PGNQLYHL 
IQNWPHYRSP 

Signal peptide: 

amino acids 1-20 
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FIGURE 123 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATGCAACT 
GACTCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGG 
ATGATGGTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCC 
CGGGTGCCTCGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCT 
CCTAGGGCTGCTGGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCC 
CGAACCACAGCCCCCCACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTAC 
TCCAACATCAAGACGGTGGCCCTGAAGCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAA 
TGGGACCTTCAGCGTCCACTTCCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCC 
TCGTGCCCCCCAGTAAAGCTGTAGAGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAG 
GCCTCCAAAATCTTCAACTGCCGGATGGAGTGGGAGAAGGTAGAACGGGGCCGCQGGACCTC 
GCTTTGCACCCACGACCCAGCCAAGATCTGCTCCCGAGACCACGCTCAGAGCTCAGCCACCT 
GGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTGTCTACATCGCCTTCTACAGCACGGACTAT 
CGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTACCATAGTGATACCCCCTACTACCCATC 
TGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGCTGGAAGGACAGGCCTGCCCATGC 
AGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTCAGGCAGGGAGGGGGGTGGAG 
ACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAGAGAAAGGGTCCCAAGTG 
CTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGGAGGAGGAGTGGGCT 
CTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGGTCCCCGAGGCC 
TGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGCCCTTGGTT 
CTTG CC AT CCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACAGCCTG 
TCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGGGC 
CAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATG 
GCATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTC 
CAGCCAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATT 
GCTGATGGCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCC 
ACAGCCCATCCGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAG 
CATCCATGTCCCGGAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCC 
CGGATCTGGATGGCGCCGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGA 
GCATGTGCTGGATCTGTTCTGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGT 
GAAACCGCTGATTGCTGACTTTTGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTT 
GCCCCGGGGCA 
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FIGURE 124 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA66521 
xsubunit 1 of 1, 252 aa, 1 stop 
><MW: 28127, pi; 8.91, NX(S/T): 5 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMAN 

STLLGLI^PPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVD 

HGNGTF S VHF QHNATGQGN ISIS L VP P S KAVE FHQEQQ I F I EAKAS KI FNCRMEWEKVERGR 

RTSLCTHDPAKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPY 
YPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyltransf erase 
amino acids 61-71 
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FIGURE 125 

GTGAATGTGAGGGTTTGATGACTTTCAGATGTCTAGGAACCAGAGTGGGTGCAGGGGCCCCA 
GGCAGGGCTGATTCTTGGGCGGAGGAGAGTAGGGTAAAGGGTTCTGCATGAGCTCCTTAAAG 
GACAAAGGTAACAGAGCCAGCGAGAGAGCTCGAGGGGAGACTTTGACTTCAAGCCACAGAAT 
TGGTGGAAGTGTGCGCGCCGCCGCCGCCGTCGCTCCTGCAGCGCTGTCGACCTAGCCGCTAG 
CATCTTCCCGAGCACCGGGATCCCGGGGTAGGAGGCGACGCGGGCGAGCACCAGCGCCAGCC 
GGCTGCGGCTGCCCACACGGCTCACCATGGGCTCCGGGCGCCGGGCGCTGTCCGCGGTGCCG 
GCCGTGCTGCTGGTCCTCACGCTGCCGGGGCTGCCCGTCTGGGCACAGAACGACACGGAGCC 
CATCGTGCTGGAGGGCAAGTGTCTGGTGGTGTGCGACTCGAACCCGGCCACGGACTCCAAGG 
GCTCCTCTTCCTCCCCGCTGGGGATATCGGTCCGGGCGGCCAACTCCAAGGTCGCCTTCTCG 
GCGGTGCGGAGCACCAACCACGAGCCATCCGAGATGAGCAACAAGACGCGCATCATTTACTT 
CGATCAGATCCTGGTGAATGTGGGTAATTTTTTCACATTGGAGTCTGTCTTTGTAGCACCAA 
GAAAAGGAATTTACAGTTTCAGTTTTCACGTGATTAAAGTCTACCAGAGCCAAACTATCCAG 
GTTAACTTGATGTTAAATGGAAAACCAGTAATATCTGCCTTTGCGGGGGACAAAGATGTTAC 
TCGTGAAGCTGCCACGAATGGTGTCCTGCTCTACCTAGATAAAGAGGATAAGGTTTACCTAA 
AACTGGAGAAAGGTAATTTGGTTGGAGGCTGGCAGTATTCCACGTTTTCTGGCTTTCTGGTG 
TTCCCCCTATAGGATTCAATTTCTCCATGATGTTCATCCAGGTGAGGGATGACCCACTCCTG 
AGTTATTGGAAGATCATTTTTTCATCATTGGATTGATGTCTTTTATTGGTTTCTCATGGGTG 
GATATGGATTCTAAGGATTCTAGCCTGTCTGAACCAATACAAAATTTCACAGATTATTTGTG 
TGTGTCTGTTTCAGTATATTTGGATTGGGACTCTAAGCAGATAATACCTATGCTTAAATGTA 
ACAGTCAAAAGCTGTCTGCAAGACTTATTCTGAATTTCATTTCCTGGGATTACTGAATTAGT 
TACAGATGTGGAATTTTATTTGTTTAGTTTTAAAAGACTGGCAACCAGGTCTAAGGATTAGA 
AAACTCTAAAGTTCTGACTTCAATCAACGGTTAGTGTGATACTGCCAAAGAACTGTATACTG 
TGTTAATATATTGATTATATTTGTTTTTATTCCTTTGGAATTAGTTTGTTTGGTTCTTGTAA 
AAAACTTGGATTTTTTTTTTCAGTAACTGGTATTATGTTTTCTCTTAAAATAAGGTAATGAA 
TGGCTTGCCCACAAATTTACCTTGACTACGATATCATCGACATGACTTCTCTCAAAAAAAAA 
GAATGCTTCATAGTTGTATTTTAATTGTATATGTGAAAGAGTCATATTTTCCAAGTTATATT 
TTCTAAGAAGAAGAATAGATCATAAATCTGACAAGGAAAAAGTTGCTTACCCAAAATCTAAG 
TGCTCAATCCCTGAGCCTCAGCAAAACAGCTCCCCTCCGAGGGAAATCTTATACTTTATTGC 
TCAACTTTAATTAAAATGATTGATAATAACCACTTTATTAAAAACCTAAGGTTTTTTTTTTT 
TCCGTAGACATGACCACTTTATTAACTGGTGGTGGGATGCTGTTGTTTCTAATTATACCTAT 
TTTTCAAGGCTTCTGTTGTATTTGAAGTATCATCTGGTTTTGCCTTAACTCTTTAAATTGTA 
TATATTTATCTGTTTAGCTAATATTAAATTCAAATATCCCATATCTAAATTTAGTGCAATAT 
CTTGTCTTTTGTATAGGTCATATGAATTCATAAAATTATTTATGTCTGTTATAGAATAAAGA 
TTAATATATGTTAAAAAAA 
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FIGURE 126 

MGSGRRALSAVPAVLLVLTLPGLPVWAQNDTEPIVLEGKCLWCDSNPATDSKGSSSSPLGI 

SVRAANSKVAFSAVRSTNHEPSEMSNKTRIIYFDQILVNVGNFFTLESVFVAPRKGIYSFSF 

HVIKVYQSQTIQVNLMLNGKPVISAFAGDKDVTREAATNGVLLYLDKEDKVYLKLEKGNLVG 
GWQYSTFSGFLVFPL 

Signal peptide: 

amino acids 1-27 
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FIGURE 127 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTC 
GCCCTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGC 
TTTCTTCTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTA 
TTGACAACAAAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTC 
TATATCCAAGAAATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTT 
GAAGAGTATAAACCCAGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCT 
TGGGCTTTGGAATCATGAGTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGG 
CCAGGCACAGTGGGCATTCATGGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGAC 
GCTGGTCATTATCTTGCTGCATGTATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGA 
AAAAGTGGGGCATCCTCCTTATCGTTCTCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTC 
ATAAGTTCTTATTATGGAATAAACCTGGCGTCAGCATTTATAATCCTGGTGCTCATGGGCAC 
CTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCGAAGCCTGAAACTCTGCCTGCTCTGCCAAG 
ACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGATAACCTCAGGGAACCAGCACTTCCCAA 
ACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCTTTTTCTGAAAATCCCTTTTTCTG 
GTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE 128 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss ,DNA66658 
xsubunit 1 of 1, 257 aa, 1 stop 
><MW: 28472, pi: 9.33, NX(S/T): 0 

MTAAVFFGCAFIAFGPALALYVFTIAIEPLRIIFLIAGAFFWLVSLLISSLVWFMARVIIDN 
KDGPTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGF 
GIMSGVFSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKW 
GILL I VLLTHLLVS AQTF I SSYYG INLASAF 1 1 LVLMGTWAFLAAGGS CRSLKLCLLCQDKN 
FLLYNQRSR 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 129 

CGGCAACCAGCCGCCGCCRCCA.CCGCTGCCACTGCCGCCCTGCCGGGGC CATG TTCGCTCTGGGCTTGCCCTTCT 

TGGTGCTCTTGGTGGCCTCGGTCGAGAGCCATCTGGGGGTTCTGGGGCCCAAGAACGTCTCGCAGAAAGACGCCG 

AGTTTGAGCGCACCTACGTGGACGAGGTCAACAGCGAGCTGGTCAACATCTACACCTTCAACCATACTGTGACCC 

GCAACAGGACAGAGGGCGTGCGTGTGTCTGTGAACGTCCTGAACAAGCAGAAGGGGGCGCCGTTGCTGTTTGTGG 

TCCGCCAGAAGGAGGCTGTGGTGTCCTTCCAGGTGCCCCTAATCCTGCGAGGGATGTTTCAGCGCAAGTACCTCT 

ACCAAAAAGTGGAACGAACCCTGTGTCAGCCCCCCACCAAGAATGAGTCGGAGATTCAGTTCTTCTACGTGGATG 

TGTCCACCCTGTCACCAGTCAACACCACATACCAGCTCCGGGTCAGCCGCATGGACGATTTTGTGCTCAGGACTG 

GGGAGCAGTTCAGCTTGAATAC C ACAGC AGCACAGCC CCAGT ACTTCAAGTATGAGTTCCCTGAAGG CGTGGACT 

CGGTAATTGTCAAGGTGACCTCCAACAAGGCCTTCCCCTGCTCAGTCATCTCCATTCAGGATGTGCTGTGTCCTG 

TCTATGACCTGGACAACAACGTAGCCTTCATCGGCATGTACCAGACGATGACCAAGAAGGCGGCCATCACCGTAC 

AGCGCAAAGACTTCCCCAGCAACAGCTTTTATGTGGTGGTGGTGGTGAAGACCGAAGACCAAGCCTGCGGGGGCT 

CCCTGCCTTTCTACCCCTTCGCAGAAGATGAACCGGTCGATCAAGGGCACCGCCAGAAAACCCTGTCAGTGCTGG 

TGTCTC^^GCAGTCACGTCTGAGGCATACGTCAGTGGGATGCTCTTTTGCCTGGGTATATTTCTCTCCTTTTACC 

TGCTGACCGTCCTCCTGGCCTGCTGGGAGAACTGGAGGCAGAAGAAGAAGACCCTGCTGGTGGCCATTGACCGAG 

CCTGCCCAGAAAGCGGTCACCCTCGAGTCCTGGCTGATTCTTTTCCTGGCAGTTCCCCTTATGAGGGTTACAACT 

ATGGCTCCTTTGAGAATGTTTCTGGATCTACCGATGGTCTGGTTGACAGCGCTGGCACTGGGGACCTCTCTTACG 

GTT ACCAGGGCCGCTC CTTTGAACCTGTAGGTACTCGGC CC CGAGTGG ACTCCATGAGCTCTGTGGAGGAGGATG 

ACTACGACAC^TTGACCGACATCGATTCCGACAAGAATGTCATTCGCACCAAGCAATACCTCTATGTGGCTGACC 

TGGCACGGAAGGACAAGCGTGTTCTGCGGAAAAAGTACCAGATCTACTTCTGGAACATTGCCACCATTGCTGTCT 

TCTATGCCCTTCCTGTGGTGCAGCTGGTGATCACCTACCAGACGGTGGTGAATGTCACAGGGAATCAGGACATCT 

GCTACTACAACTTCCTCTGCGCCCACCGACTGGGCAATCTCAGCGCCTTCAACAACATCCTCAGCAACCTGGGGT 

ACATCCTGCTGGGGCTGCTTTTCCTGCTCATCATCCTGC^CGGGAGATGAACCACAACCGGGCCCTGCTGCGCA 

ATGACCTCTGTGCCCTGGAATGTGGGATCCCC^AACACTTTGGGCTTTTCTACGCCATGGGCACAGCCCTGATGA 

TGGAGGGGCTGCTCAGTGCTTGCTATCATGTGTGCCCCAACTATACCAATTTCCAGTTTGACA 

ACATGATCGCCGGACTCTGCATGCTGAAGCTCTACCAGAAGCGGCACCCGGACATCAACGCCAGCGCCTACAGTG 

CCTACGCCTGCCTGGCCATTGTCATCTTCTTCTCTGTGCTGGGCGTGGTCTTTGGCAAAGGGAACACGGCGTTCT 

GGATCGTCTTCTCCATCATTCACATCATCGCCACCCTGCTCCTCAGCACGCAGCTCTATTACATGGGCCGGTGGA 

AACTGGACTCGGGGATCTTCCGCCGCATCCTCCACGTGCTCTACACAGACTGCATCCGGCAGTGCAGCGGGCCGC 

TCTACGTGGACCGCATGGTGCTGCTGGTCATGGG CAACGTC ATCAACTGGTCG CTGG CTGCCTATGGGCTTATCA 

TGCGCCCCAATGATTTCGCTTCCTACTTGTTGGCCATTGGCATCTGCAACCTGCTCCTTTACTTCGCCTTCTACA 

TCATCATGAAGCTCCGGAGTGGGGAGAGGATCAAGCTCATCCCCCTGCTCTGCATCGTTTGCACCTCCGTGGTCT 

GGGGCTTCGCGCTCTTCTTCTTCTTCCAGGGACTCAGCACCTGGCAGAAAACCCCTGCAGAGTCGAGGGAGCACA 

ACCGGGACTGCATCCTCCTCGACTTCTTTGACGACCACGACATCTGGCACrTCCTCTCCTCCATCGCCATGTTCG 

GGTCCTTCCTGGTGTTGCTGACACTGGATGACGACCTGGATACTGTGCAGCGGGACAAGATCTATGTCTTCTAGC 

AGGAGCTGGGCCCrTCGCTTCACCTCAAGGGGCCCTGAGCTCCTTTGTGTCATAGACCGGTCACTCTGTCGTGCT 

GTGGGGATGAG TC CCAG CACCGCTGCCCAGCACTGGATGGCAGCAGGACAGCCAGGTCTAG CTTAGGCTTGGCCT 

GGGACAGCCATGGGGTGGCATGGAACCTTGCAGCTGCCCTCTGCCGAGGAGCAGGCCTGCTCCCCTGGAACCCCC 

AGATGTTGGCCAAATTGCTGCTTTCTTCTCAGTGTTGGGGCCTTCCATGGGCCCCTGTCCTTTGGCTCTCC 

GTCCCTTTGCAAGAGGAAGGATGGAAGGGACACCCTCCCCATTTCATGCC1TGCATTTTGCCCGTCCT 

ACAATGCCCCAGCCTGGGACCTAAGGCCTCTTTTTCCTCCCATACTCCCACTCCAGGGCCTAGTCTGGGGCCTGA 

ATCTCTGTCCTGTATCAGGGCC C C AGTTCTCTTTGGGCTGTCCCTGGCK3CCATCACTGCCCATTC CAGTCAGCC 

AGGATGGATGGGGGTATGAGATTTTGGGGGTTGGCCAGCTGGTGCCMACnT^ 

CCTGGGGCAGTGCGTATTCTCTTCCCTCTGACCTGTGCTCAGGGCTGGCTCTTTAGCAATGCGCTCAGCCCAATT 

TGAGAACCGCCTTCTGATTCAAGAGGCTGAATTCAGAGGTCACCTCTTCATCCCATCAGCTC 

AGCACCAGGACTGGAGGGAGAAGCGCCTCACCCCTTCCCTTCCTTCTTTCCAGGCCCTTAGTCTTGCCAAACCCC 

AGCTGGTGGCCTTTCAGTGCCATTGACACTGCCCAAGAATGTCC^GGGGCAAAGGAGGGATGATACAGAGTTCAG 

CCCGTTCTGCCTCCACAGCTGTGGGCACCCCAGTGC^ 

CCCTCTACGTGCCCAGTCCTAGCCTCGCTCTAGGACCCAGGGCTGGCTTCTAAGTTTCCGTCCAGTCTTCAGGCA 
AGTTCTGTGTTAGTCATGCAC^CACATACCTATGAAACCl^ 

CCTGGCCACCCTGGTCCTTGGATCCCCTTCGTCCCACCTGGTCCACCCCAGATGCTGAGGATGGGGGAGCTCAGG 
CGGGGCCTCTGCTTTGGGGATGGGAATGTGTTTTTCTCCCAA^ 

AGATGAGGTGGGTCTGGATCTTTTCTCAGAGCGTCTCCATGCTATGGTTGCATTTCCGTTTTCTATGAATGAATT 
TGCATTCAATAAACAACCAGACTCAAAAAAAAAAAAAAA 
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FIGURE 130 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA66659 
xsubunit 1 of l, 832 aa, 1 stop" 
><MW: 94454, pi: 6.94, NX(S/T): 12 

MFALGL PFLVLL VAS VE S HLG VLG PKNVSQKD AE FE RT YVDE VNS E LVNI YTFNHTVTRNRT 
EGVRVSVNVLNKQKGAPLLFWRQKEAWSFQVPLILRGMFQRKYLYQKVERTLCQPPTKNE 
SEIQFFYVDVSTLSPVNTTYQLRVSRMDDFVLRTGEQFSFNTTAAQEQYFKYEFPEGVDSVI 
VKVTSNKAFPCS VI S I QD VLCPVYDLDNNVAF IGMYQTMTKKAAI TVQRKDFPSNSFYVVVV 
VKTEDQACGGSLPFYPFAEDEPVDQGHRQKTLSVLVSQAVTSEAYVSGMLFCLGIFLSFYLL 
TVLLACWENWRQ KKKT LLVAI DRACPES GHPRVLADS F PGS S P YEG YNYG S FENVS GS TDGL 
VDSAGTGDLSYGYQGRSFEPVGTRPRVDSMSSVEEDDYDTLTDIDSDKNVIRTKQYLYVADL 
ARKDKRVLRKKYQIYFWNIATIAVFYALPVVQLVITYQTVVNVTGNQDICYYNFLCAHPLGN 
LSAFNNILSNLGYILLGLLFLLIILQREINHNRALLRNDLCALECGIPKHFGLFYAMGTALM 
MEGLLSACYHVCPNYTNFQFDTSFMYMIAGLCMLKLYQKRHPDINASAYSAYACLAIVIFFS 
VLG WFGKGNTAFW I VFS I IHI IATLLLSTQLYYMGRWKLDSGIFRRILHVLYTDCIRQCSG 
PLYVDRMVLLVMGOTINWSLAAYGLimPNDFASYLIAIGICNLLLYFAFYIIMKLRSGERI 
KLIPLIiCIVCTSWWGFALFFFFQGLSTWQKTPAESREHNRDCILLDFFDDHDIWHFLSSIA 
MFGSFLVLLTLDDDLDTVQRDKIYVF 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

Transmembrane domains: 

amino acids 292-317, 451-470, 501-520, 607-627, 751-770 

Leucine zipper pattern. 

amino acids 497-518 

N-glycosylation sites. 

amino acids 27-30, 54-57, 60-63, 123-126, 141-144, 165-168, 364- 
367, 476-479, 496-499, 572-575, 603-606, 699-702 
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FIGURE 131 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGC 
TCTCTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTG 
ACCA^GGTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGC 
CCAGCCTGCAGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACC 
TGACCAAGTTGCCGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCA 
GGCAAGGCAACTGAGGGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAG 
GGCCCTGGACCGAGAGGAGCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATG 
GACATGTCTTGTGGGGTCCACAGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTG 
CCCCATTTCTCTCAAGCCATCTACAGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCC 
CTTCCTCTTCCTTGAGGCTTCAGACCGGGATGAGCCAGGCACAGCCAACTCGGATCTTCGAT 
TCCACATCCTGAGCCAGGCTCCAGCCCAGCCTTCCCCAGACATGTTCCAGCTGGAGCCTCGG 
CTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGCACCAGCCTTGACCACGCCCTGGAGAGGAC 
CTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGACCAGGCCTCAGGCCACCAGGCCACTG 
CCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGTCCCTAGAGCCTATCCACCTGGCA 
GAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTACACTGGAGTGGG6GTGATGT 
GCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAATGCAGAGGGAAACCTCT 
ACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCCAGGTGCGGGCTCAG 
AATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTGATGGATGAGAA 
TGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCTCAGTCCAC 
CAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCAATTCC 
CACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTCCA 
GGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACA 
TCCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGT 
GAAGTCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGAT 
TGGGCCTATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCA 
TTGATGCTGACCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACA 
GAAGGGACTTTTGGCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAA 
GAACCTCAGTTATGAGGCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGA 
AGCTGGTGGGGCCAGGCCCAGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGA 
GTGATGCCACCCCCCAAGTTGGACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCC 
AGCCGGCTCTTTCCTGCTGACCATCCAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCT 
CCCTAGTCAATGACTCAGAGGGCTGGCTCTGCATTGAGAAATTCTCCGGGGAGGTGCACACC 
GCCCAGTCCCTGCAGGGCGCCCAGCCTGGGGACACCTACACGGTGCTTGTGGAGGCCCAGGA 
TACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAATACCTCTGCACACCCCGCCAAGACCATG 
GCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCTGGCCAGTGGGCACGGTCCCTACAGC 
TTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGCGCCTCCAGACTCTCAATGGTTC 
CCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGTGAACACATAATCCCCGTGG 
TGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGATCGTGTGTCGCTGCAAC 
GTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCACGAAGCTGTCGGC 
AGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTCATTTTCACCC 
ACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGTGCCCCTGAAGGCG 
ACTGTCTgAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAGTCCC 
CTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCAT 
CTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAAC 
TTTATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCA 
GAGAACTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 132 

MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSG 
KATEGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVP 
HFSQAIYRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRL 
GALALSPKGSTSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IESTWVSLEPIHLAE 
NLKVL YPHHMAQVHWS GGD VHYHLES H P PG P F E VNAEGNLYVTRELDRE AQ AE YLLQ VRAQN 
SHGEDYAAPLELHVLVMDENDNVP I CP PRDPT VS I PELS PPGTEVTRLSAEDADAPGS PNSH 
WYQLLSPEPEDGVEGRAFQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCE 
VE VAVTD I NDHAPE F I TSQI GP I S LPEDVEPGTLVAMLTAI DADLE PAFRLMDFAI ERGDTE 
GTFGLDWEPDSGHVRLRLCKNLSYEAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERV 
MPPPKLDQESYEASVPISAPAGSFLLTIQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTA 
QSLQGAQPGDTYTVLVEAQDTALTLAPVPSQYLCTPRQDHGLIVSGPSKDPDLASGHGPYSF 
TLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEPREHI I PVWSHNAQMWQLLVRVI VCRCNV 
EGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGIFLILIFTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain : 

amino acids 762-784 
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FIGURE 133 

CCGGGGACA^GAGGTGGATACTGTTCATTGGGGCCCTTATTGGGTCCAGCATCTGTGGCCAA 
GAAAAATTTTTTGGGGACCAAGTTTTGAGGATTAATGTCAGAAATGGAGACGAGATCAGCAA 
ATTGAGTCAACTAGTGAATTCAAACAACTTGAAGCTCAATTTCTGGAAATCTCCCTCCTCCT 
TCAATCGGCCTGTGGATGTCCTGGTCCCATCTGTCAGTCTGCAGGCATTTAAATCCTTCCTG 
AGATCCCAGGGCTTAGAGTACGCAGTGACAATTGAGGACCTGCAGGCCCTTTTAGACAATGA 
AGATGATGAAATGCAACACAATGAAGGGCAAGAACGGAGCAGTAATAACTTCAACTACGGGG 
CTTACCATTCCCTGGAAGCTATTTACCACGAGATGGACAACATTGCCGCAGACTTTCCTGAC 
CTGGCGAGGAGGGTGAAGATTGGACATTCGTTTGAAAACCGGCCGATGTATGTACTGAAGTT 
CAGCACTGGGAAAGGCGTGAGGCGGCCGGCCGTTTGGCTGAATGCAGGCATCCATTCCCGAG 
AGTGGATCTCCCAGGCCACTGCAATCTGGACGGCAAGGAAGATTGTATCTGATTACCAGAGG 
GATCCAGCTATCACCTCCATCTTGGAGAAAATGGATATTTTCTTGTTGCCTGTGGCCAATCC 
TGATGGATATGTGTATACTCAAACTCAAAACCGATTATGGAGGAAGACGCGGTCCCGAAATC 
CTGGAAGCTCCTGCATTGGTGCTGACCCAAATAGAAACTGGAACGCTAGTTTTGCAGGAAAG 
GGAGCCAGCGACAACCCTTGCTCCGAAGTGTACCATGGACCCCACGCCAATTCGGAAGTGGA 
GGTGAAATCAGTGGTAGATTTCATCCAAAAACATGGGAATTTCAAGGGCTTCATCGACCTGC 
ACAGCTACTCGCAGCTGCTGATGTATCCATATGGGTACTCAGTCAAAAAGGCCCCAGATGCC 
GAGGAACTCGACAAGGTGGCGAGGCTTGCGGCCAAAGCTCTGGCTTCTGTGTCGGGCACTGA 
GTACCAAGTGGGTCCCACCTGCACCACTGTCTATCCAGCTAGCGGGAGCAGCATCGACTGGG 
CGTATGACAACGGCATCAAATTTGCATTCACATTTGAGTTGAGAGATACCGGGACCTATGGC 
TTCCTCCTGCCAGCTAACCAGATCATCCCCACTGCAGAGGAGACGTGGCTGGGGCTGAAGAC 
CATCATGGAGCATGTGCGGGACAACCTCTACTAGGCGATGGCTCTGCTCTGTCTACATTTAT 
TTGTACCCACACGTGCACGCACTGAGGCCATTGTTAAAGGAGCTCTTTCCTACCTGTGTGAG 
TCAGAGCCCTCTGGGTTTGTGGAGCACACAGGCCTGCCCCTCTCCAGCCAGCTCCCTGGAGT 
CGTGTGTCCTGGCGGTGTCCCTGCAAGAACTGGTTCTGCCAGCCTGCTCAATTTTGGTCCTG 
CTGTTTTTGATGAGCCTTTTGTCTGTTTCTCCTTCCACCCTGCTGGCTGGGCGGCTGCACTC 
AGCATCACCCCTTCCTGGGTGGCATGTCTCTCTCTACCTCATTTTTAGAACCAAAGAACATC 
TGAGATGATTCTCTACCCTCATCCACATCTAGCCAAGCCAGTGACCTTGCTCTGGTGGCACT 
GTGGGAGACACCACTTGTCTTTAGGTGGGTCTCAAAGATGATGTAGAATTTCCTTTAATTTC 
TCGCAGTCTTCCTGGAAAATATTTTCCTTTGAGCAGCAAATCTTGTAGGGATATCAGTGAAG 
GTCTCTCCCTCCCTCCTCTCCTGTTTTTTTTTTTTTTGAGACAGAGTTTTGCTCTTGTTGCC 
CAGGCTGGAGTGTGATGGCTCGATCTTGGCTCACCACAACCTCTGCCTCCTGGGTTCAAGCA 
ATTCTCCTGCCTCAGCCTCTTGAGTAGCTTGGTTTATAGGCGCATGCCACCATGCCTGGCTA 
ATTTTGTGTTTTTAGTAGAGACAGGGTTTCTCCATGTTGGTCAGGCTGGTCTCAAACTCCCA 
ACCTCAGGTGATCTGCCCTCCTTGGCCTCCCAGAGTGCTGGGATTACAGGTGTGAGCCACTG 
TGCCGGGCCCGTCCCCTCCTTTTTTAGGCCTGAATACAAAGTAGAAGATCACTTTCCTTCAC 
TGTGCTGAGAATTTCTAGATACTACAGTTCTTACTCCTCTCTTCCCTTTGTTATTCAGTGTG 
ACCAGGATGGCGGGAGGGGATCTGTGTCACTGTAGGTACTGTGCCCAGGAAGGCTGGGTGAA 
GTGACCATCTAAATTGCAGGATGGTGAAATTATCCCCATCTGTCCTAATGGGCTTACCTCCT 
CTTTGCCTTTTGAACTCACTTCAAAGATCTAGGCCTCATCTTACAGGTCCTAAATCACTCAT 
CTGGCCTGGATAATCTCACTGCCCTGGCACATTCCCATTTGTGCTGTGGTGTATCCTGTGTT 
TCCTTGTCCTGGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTGTGTGTGTGTG 
TCTGTCTATTTTGTATCCTGGACCACAAGTTCCTAAGTAGAGCAAGAATTCATCAACCAGCT 
GCCTCTTGTTTCATTTCACCTCAGCACGTACCATCTGTCCTTTTGTTGTTGTTGTTTTGTTT 
TTGTTTTTTTGCTTTTACCAAACATGTCTGTAAATCTTAACCTCCTGCCTAGGATTTGTACA 
GCATCTGGTGTGTGCTTATAAGCCAATAAATATTCAATGTGAAAAAAAAAAAAAAAAA 
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FIGURE 134 

MRWILFIGALIGSSICGQEKFFGDQVLRINVRNGDEISKLSQLVNSNNLKLNFWKSPSSFNR 
PVDVLVPSVSLQAFKSFLRSQGLEYAVTIEDLQALLDNEDDEMQHNEGQERSSNNFNYGAYH 
SLEAIYHEMDNIAADFPDLARRVKIGHSFENRPMYVLKFSTGKGVRRPAVWLNAGIHSREWI 
SQATAI WTARKI VSDYQRDPAI TS ILEKMD I FLLPVANPDGYVYTQTQNRLWRKTRSRNPGS 
SCIGADPNRNWNASFAGKGASDNPCSEVYHGPHANSEVEVKSWDFIQKHGNFKGFIDLHSY 
SQLLMYPYGYSVKKAPDAEELDKVARLAAKALASVSGTEYQVGPTCTTVYPASGSSIDWAYD 
NGIKFAFTFELRDTGTYGFLLPANQIIPTAEETWLGLKTIMEHVRDNLY 

Signal peptide: 
amino acids 1-16 



WO 00/12708 



PCTAJS99/201H 



FTGTJRE 135 

CAACCATGCAAGGACAGGGCAGGAGAAGAGGAACCTGCAAAGACATATTTTGTTCCAAAATG 

GCATCTTACCTTTATGGAGTACTCTTTGCTGTTGGCCTCTGTGCTCCAATCTACTGTGTGTC 

CCCGGCCAATGCCCCCAGTGCATACCCCCGCCCTTCCTCCACAAAGAGCACCCCTGCCTCAC 

AGGTGTATTCCCTCAACACCGACTTTGCCTTCCGCCTATACCGCAGGCTGGTTTTGGAGACC 

CCGAGTCAGAACATCTTCTTCTCCCCTGTGAGTGTCTCCACTTCCCTGGCCATGCTCTCCCT 

TGGGGCCCACTCAGTCACCAAGACCCAGATTCTCCAGGGCCTGGGCTTCAACCTCACACACA 

CACCAGAGTCTGCCATCCACCAGGGCTTCCAGCACCTGGTTCACTCACTGACTGTTCCCAGC 

AAAGACCTGACCTTGAAGATGGGAAGTGCCCTCTTCGTCAAGAAGGAGCTGCAGCTGCAGGC 

AAATTTCTTGGGCAATGTCAAGAGGCTGTATGAAGCAGAAGTCTTTTCTACAGATTTCTCCA 

ACCCCTCCATTGCCCAGGCGAGGATCAACAGCCATGTGAAAAAGAAGACCCAAGSGAAGGTT 

GTAGACATAATCCAAGGCCTTGACCTTCTGACGGCCATGGTTCTGGTGAATCACATTTTCTT 

TAAAGCCAAGTGGGAGAAGCCCTTTCACCTTGAATATACAAGAAAGAACTTCCCATTCCTGG 

TGGGCGAGCAGGTCACTGTGCAAGTCCCCATGATGCACCAGAAAGAGCAGTTCGCTTTTGGG 

GTGGATACAGAGCTGAACTGCTTTGTGCTGCAGATGGATTACAAGGGAGATGCCGTGGCCTT 

CTTTGTCCTCCCTAGCAAGGGCAAGATGAGGCAACTGGAACAGGCCTTGTCAGCCAGAACAC 

TGATAAAGTGGAGC CACT CACTCCAGAAAAGGTGGATAGAGGTGTT CATCCCCAGATTTTCC 

ATTTCTGC CTCCTACAAT CTGGAAACCATC CTCCCGAAGATGGGCATCCAAAATGCCTTTGA 

CAAAAATGCTGATTTTTCTGGAATTGCAAAGAGAGACTCCCTGCAGGTTTCTAAAGCAACCC 

ACAAGGCTGTGCTGGATGTCAGTGAAGAGGGCACTGAGGCCACAGCAGCTACCACCACCAAG 

TTCATAGTCCGATCGAAGGATGGTCCCTCTTACTTCACTGTCTCCTTCAATAGGACCTTCCT 

GATGATGATTACAAATAAAGCCACAGACGGTATTCTCTTTCTAGGGAAAGTGGAAAATCCCA 

CTAAATCCTAGGTGGGAAATGGCCTGTTAACTGATGGCACATTGCTAATGCACAAGAAATAA 

CAAACCACATCCCTCTTTCTGTTCTGAGGGTGCATTTGACCCCAGTGGAGCTGGATTCGCTG 

GCAGGGATGCCACTTCCAAGGCTCAATCACCAAACCATCAACAGGGACCCCAGTCACAAGCC 

AACACCCATTAACCCCAGTCAGTGCCCTTTTCCACAAATTCTCCCAGGTAACTAGCTTCATG 

GGATGTTGCTGGGTTACCATATTTCCATTCCTTGGGGCTCCCAGGAATGGAAATACGCCAAC 

CCAGGTTAGGCACCTCTATTGCAGAATTACAATAACACATTCAATAAAACTAAAATATGAAT 

TCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAA 



WO 00/12708 



PCT/US99/20111 

* 



FIGURE 136 

MASYLYGVLPAVGLCAPIYCVSPANAPSAYPRPSSTKSTPASQVYSLNTDFAFRLYRRLVLE 
TPSQNIFFSPVSVSTSLAMLSLGAHSVTKTQILQGLGFNLTHTPESAIHQGFQHLVHSLTVP 
SKDLTLKMGSALFVKKELQLQANFLGNVKRLYEAEVFSTDFSNPSIAQARINSHVKKKTQGK 
VVDIIQGLDLLTAMVLVNHIFFKAKWEKPFHLEYTRKNFPFLVGEQVTVQVPMMHQKEQFAF 
GVDTELNCFVLQMDYKGDAVAFFVLPSKGKMRQLEQALSARTLIKWSHSLQKRWIEVFIPRF 
S I S ASYNLET I LPKMG I QNAFDKNADFS G I AKRD SLQVSKATHKAVLDVSEEGTEATAATTT 
KFIVRSKDGPSYFTVSFNRTFLMMITNKATDGILFLGKVENPTKS 

Signal peptide: 

amino acids 1-20 



/37/2- 7C > 



WO 00/12708 



PCT/US99/20U1 



FIGURE 137 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGC 
CTGAGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAG 
CCTTTATCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAG 
TCTTGGTACATCTAGGACCCAGGCATCTTGCTTTCCAGCCACAAAGAGACAGATGAAGATGC 
AGAAAGGAAATGTTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCC 
AATGAGACTAGCACCTCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGC 
CACCAACTCTGGGTCCAGTGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCA 
GCGTGACCTCCAATGGGGTCAGCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGG 
ATCAGCACAGCCACCAACTCTGAGTTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAA 
CTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAC 
CCTCCAGTGGGGCCAGCACAGTCACCAACTCTGGGTCCAGTGTGACCTCCAGTGGAGCCAGC 
ACTGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACTGCCACCAACTCTGA 
GTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCA 
GTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCC 
ACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACTGCCACCAACTCTGAGTCCAG 
CACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGAACGACCTCCAATGGGG 
CTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAAC 
TCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACGAC 
CTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCTAGCA 
CAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACTCTGAG 
TCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCCAG 
TGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCA 
CCAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGC 
ACAACCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGC 
TAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGAGGCCAGCACAGCCACCAACT 
CTGAGT CTAGCACAGTGT CCAGTGGGATCAGCACAGT CACCAATTCTGAGTCCAGCACAACC 
TCCAGTGGGGCCAACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAAC 
AGCAGCTCTGACTGGAATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAA 
AGCCTGGTGGGTCCCTGGTGCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCG 
GCCGTGGGGCTCTTTGCTGGGCTCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACAC 
CTTTAACACAGCTGTCTACCACCCTCATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAG 
GGAATCATGGAGCCCCCCACAGGCCCAGGTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTA 
TCATCGATAGCCATGGAGATGAGCGGGAGGAACAGCGGGCCCTGAGCAGCCCCGGAAGCAAG 
TGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCACCCAAGACCTGGTTTCCTTTCATTCATC 
CCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAAAATCTTGAAGAAGGTATTCCTCACC 
TTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATATTGCTCATTTAGCTAAGAAATAA 
ATACATCTCATCTAACACACACGACAAAGAGAAGCTGTGCTTGCCCCGGGGTGGGTATCTAG 
CTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTCCATCTGGCATTCAAAA 
TCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAA 
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FIGURE 138 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATI 
SGSSVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSE 
SSTPSSGASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSS 
TTSSGASTATNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTT 
SNGAGTATNSESSTTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSS 
GASTATNSDSSTTSSGAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGA 
NTATNSESSTVSSGASTATNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEAST 
ATNSESSTVSSGISTVTNSESSTTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAV 
SEAKPGGSLVPWEIFLITLVSWAAVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGP 
GPGGNHGAPHRPRWSPNWFWRRPVSSIAMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 



WO 00/12708 



PCT/US99/20111 



FTGTJRE 139 

GGGAGAGAGGATAAATAGCAGCGTGGCTTCCCTGGCTCCTCTCTGCATCCTTCCCGACCTTC 
CCAGCAATATGCATCTTGCACGTCTGGTCGGCTCCTGCTCCCTCCTTCTGCTACTGGGGGCC 
CTGTCTGGATGGGCGGCCAGCGATGACCCCATTGAGAAGGTCATTGAAGGGATCAACCGAGG 
GCTGAGCAATGCAGAGAGAGAGGTGGGCAAGG CCCTGGATGG CATC AACAGTGGAATCACGC 
ATGCCGGAAGGGAAGTGGAGAAGGTTTTCAACGGACTTAGCAACATGGGGAGCCACACCGGC 
AAGGAGTTGGACAAAGGCGTCCAGGGGCTCAACCACGGCATGGACAAGGTTGCCCATGAGAT 
CAACCATGGTATTGGACAAGCAGGAAAGGAAGCAGAGAAGCTTGGCCATGGGGTCAACAACG 
CTGCTGGACAGGCCGGGAAGGAAGCAGACAAAGCGGTCCAAGGGTTCCACACTGGGGTCCAC 
CAGGCTGGGAAGGAAGCAGAGAAACTTGGCCAAGGGGTCAACCATGCTGCTGACCAGGCTGG 
AAAGGAAGTGGAGAAGCTTGGCCAAGGTGCCCACCATGCTGCTGGCCAGGCCGGGAAGGAGC 
TGCAGAATGCTCATAATGGGGTCAACCAAGCCAGCAAGGAGGCCAACCAGCTGCTGAATGGC 
AACCATCAAAGCGGATCTTCCAGCCATCAAGGAGGGGCCACAACCACGCCGTTAGCCTCTGG 
GGCCTCAGTCAACACGCCTTTCATCAACCTTCCCGCCCTGTGGAGGAGCGTCGCCAACATCA 
TGCCCTAAACTGGCATCCGGCCTTGCTGGGAGAATAATGTCGCCGTTGTCACATCAGCTGAC 
ATGACCTGGAGGGGTTGGGGGTGGGGGACAGGTTTCTGAAATCCCTGAAGGGGGTTGTACTG 
GGATTTGTGAATAAACTTGATACACCA 



WO 00/12708 



PO7US99/201H 



FTGIJRE 140 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA66675 
xsubunit l of 1, 247 aa, 1 stop 
><MW: 25335, pi: 7.00, NX(S/T): 0 

MHLARLVGS CSLLLLLGALSGWAASDDP I EKV I EG I NRGLSNAERE VGKALDG INSGITHAG 
REVEKVFNGLSNMGSHTGKELDKGVQGLNHGMDKVAHEINHGIGQAGKEAEKLGHGVNNAAG 
QAGKEADKAVQGFHTGVHQAGKEAEKLGQGVNHAADQAGKEVEKLGQGAHHAAGQAGKELQN 
AHNGVNQASKEANQLLNGNHQSGSSSHQGGATTTPLASGASVNTPFINLPALWRSVANIMP 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

Homologous region to circumsporozoite (CS) repeats: 
amino acids 35-225 
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FIGURE 141 

CTCCGGGTCCCCAGGGGCTGCGCCGGGCCGGCCTGGCAAGGGGGACGAGTCAGTGGACACTCCAGGAAGAGCGGC 
CCCGCGGGGGGCGATGACCGTGCGCTGACCCTGACTCACTCCAGGTCCGGAGGCGGGGGCCCCCGGGGCGACTCG 
GGGGCGGACCGCGGGGCGGAGCTGCCGCCCGTGAGTCCGGCCGAGCCACCTGAGCCCGAGCCGCGGGACACCGTC 
GCTCCTGCTCTCCGAATGCTGCGCACCGCGATGGGCCTGAGGAGCTGGCTCGCCGCCCCATGGGGCGCGCTGCCG 
CCTCGGCCACCGCTGCTOCTGCTCCTGCTGCTGCT^ 

CCCCGGATCAGCCTGCCTCTGGGCTCTGAAGAGCGGCCATTCCTCAGATTCGAAGCTGAACACATCTCCAACTAC 
AC AG CC CTTCTG CTX3AGCAG<^ ATGGCAGG ACCCTGT ACGTGGGTGCTCGAG AGGCCCTCTTTG CACTCAGTAGC 
AACCTCAGCTTCCTGCCAGGCGGGGAGTACCAGGAGCTGCTTTGGGGTGCAGACGCAGAGAAGAAACAGCAGTGC 
AGCTTCAAGGGCAAGGACCCACAGCGCGACTGTCAAAACTACATCAAGAT^ 

CTGTT CACCTGTG GCACAGCAG CCTTCAGCCCCATGTGT ACCTACATCAACATGGAGAACTTCACCCTGGCAAGG 
GACGAGAAGGGGAATGTCCTCCTGGAAGATGGCAAGGGCCGTTGTCCCTTCGACCCGAATTTCAAGTCCACTGCC 
CTGGTGGTTGATGGCGAGCTCTACACTGGAACAGTCAGCAGCTTCCAAGGGAATGACCCGGCCATCTCGCGGAGC 
CAAAGCCTTCGCCCCACCAAGACCGAGAGCTCCCTCAACTGGCTGCAAGACCCAGCTTTT 

ATTCCTGAGAGCCTGGGCAGCTTGCAAGGCGATGATGACAAGATCTACTTTTTCTTCAGCGAGACTGGCCAGGAA 
TTTGAGTTCTTTGAGAACACCATTGTGTCCCGCATTGCCCGCATCTGCAAGGGCGATGAGGGTGGAGAGCGGGTG 
CTACAGCAGCG CTGG ACCT CCTTCCT CAAGGCCCAGCTGCTGTGCTCACGGCCCGACGATGGCTTC CCCTTCAAC 
GTGCTGCAGGATGTCTTCACGCTG AGCCCCAGCC CCC AGGACTGG CGTGACACC CTTTTCTATGGGGTCTT CACT 
TCCCAGTGGCACAGGGGAACTACAGAAGGCTCTGCCGTCTGTGTCTTCACAATGAAGGATGTGCAGAGAGTCTTC 
AGCGGCCTCTACAAGG AGGTG AACCGTGAGACACAG C AGTGGTACACCGTGACC CACCCGGTGCCCACACCC CGG 
CCTGGAGCGTGCATCACCAACAGTGCCCGGGAAAGGAAGATCAACTCATCCCTGCAGCTCCCAGACCGCGTGCTG 
AACTTCCTC^GGACCACTTCCTGATGGACGGGCAGGTCCGAAGCCG(^TGCTGCTGCTGCAGCCCCAGGCTCGC 
TACCAGCG CGTGGCTGT AC AC CG CGTC CCTGGCCTGCACC ACAC CTACGATGTCCTCTTCCTGGGCACTGGTGAC 
GGCCGGCTCCACAAGGCAGTGAGCGTGGGCCCCCGGGTGCACATCATTGAGGAGCTGCAGATCTTCTCATCGGGA 
CAGCCCGTGCAGAATCTGCTCCTGGACACCCACAGGGGGCTGCTGTATGCGGCCTCACACTCGGGCGTAGTCCAG 
GTGCCCATGGCCAACTGCAGCCTGTACCGGAGCTGTGGGGACTGCCTCCTCGCCCGGGACCCCTACTGTGCTTGG 
AGCGGCTCCAGCTGO^AGCACGTCAGCCTCTACCAGCCTCAGCTGGCCACCAGGCCGTGGATCCAGGACATCGAG 
GGAGCCAGCGCCAAGGACCTTTGCAGCGCGTCTTCGGTTGTGTCCCCGTCTTTTGTACCAACAGGGGAGAAGCCA 
TGTGAGCAAGTCC^GTTCCAGCCCAACACAGTGAAGACTTT 

CTCTGGCTACGCAACGGGGCCCCCGTCAATGCCTCGGCCTCCTGCCACGTGCTACCCACTGGG^ACCTGCTGCTG 
GTGGGCACCCAACAGCTGGGGGAGTTCC^GTGCTGGTCACTAGAGGAGGGCTTCC^GCAGCrGGTAGCCAGCTAC 
TGCCCAGAGGTGGTGGAGGACGGGGTGGCAGACCAAACAGATGAGGGTGGCAGTGTACCCGTCATTATCAGCACA 
TCGCGTGTGAGTGCACC^GCTGGTGGCAAGGCCIAGCTGGGGTGCAGACAGGTCCTACTGGAAGXsAGTTCCTGGTG 
ATGTGCACGCTCTTTGTGCTGGCCGTGCTGCTCCCAGTTTTATTCT 

GTCTTCCTGAAGCAGGGGGAATGTGCCAGCGTGCACCCCAAGACCTGCCCTGTGGTGCTGCCCCCTGAGACCCGC 
CCACTCAACGGCCTAGGGCCCCCTAG CACCCCGCTCGATCACCGAGGGTACCAGT CCCTGT CAGACAGCCCCCCG 
GGGGCCCGAGTCTTCACTGAGTCAGAGAAGAGGCCACTCAGCATCC^ 

TGCCCCCGGCCCCGGGTCCGCCrTGGCTCGGAGATCCGTGACTCTGTGGTGTGAGAGCTGACTTCCAGAGGACGC 

TGCCCTGGCTTC^GGGGCTGTGAATGCTCGGAGAGGGTC^CrrGGACCTCCCCTCCGCTCrGCTCTTCGTGGAAC 

ACX^CCGTGGTGCCCGGCCCTTGGGAGCCTTGGAGCCAGCTGGCCTGCTGCTCTCCAGTCAAGTAGCGAAGCTCC 

TACCACCCAGACACCCAAACAGCCGTGGCCCCAGAGGTCCTGGCCAAATAT^ 

CAGTGCTCCTTATGTAAACTGAGCCCTTTGTTTAAAAAACAATTCCAAATGTGAAACTAGAATGA^ 

ATAGCATGGCATGCAG CAC AC ACGGCTGCTC CAGTTCATGGCCTCCCAGGGGTGCTGGGGATGCATCCAAAGTGG 

TTGTCTGAGACAGAGTTGGAAACCCTCACCAACTGGCCTCTTCACCTTCCACATTATCCCGCTGCCACCGGCTGC 

CCTGTCTCACTGCAGATTC^GGACCAGCTTGGGCTGCGTGCGTTCTGCCTTGCCAGTCAGCCGAGGATGTAGTTG 

TTGCTGCCGTCGTCCCACCACCTCAGGGACCIAGAGGGCTAGGTTG^ 

GGACCCMCTCCTGGACCTTTCCAGCCTGTATCAGGCTGTGGCCACACGAGAGGAa^GCGCGAGCTCAGGAGAGA 
TTTCGTGACAATGTACGCCTTTCCCTC^GAATTGAGGGAAGAGACTC 

GAACCCGTGTGCCCCTTC CCACC AT ATCC AC CCTCGCTCCATCTTTGAACTCAAACACG AGGAACTAACTGCAC C 
CTGGTCCTCTCCCCAGTCCCCAGTTCACCCTCCATCCCTC^CCrTCCTCCACTCTAAGGGATATC^CACTGCCC 
AGCACAGGGGCCCTGAATTTATGTGGTTTTTATACATTTTTTAATAAGATGC^CTTTATG 
GT CTG AAG AATT ACTGTT T AAAAAAAAAAAA 



WO 00/12708 



PCT/US99/20111 



FTGTJRE 142 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA67962 
xsubunit 1 of 1, 837 aa, 1 stop 
><MW: 92750, pi: 7.04, NX(S/T): 6 

MLRTAMGLRSWLAAPWGALPPRPPLLLLLLLLLLLQPPPPTWALSPRISLPLGSEERPFLRF 
EAEH I SNYTALLLSRDGRTLYVGARE ALFALS SNLS FLPGGE YQELLWGADAEKKQQCSFKG 
KDPQRDCQNYIKILLPLSGSHLFTCGTAAFSPMCTYINMENFTLARDEKGNVLLEDGKGRCP 
FDPNFKSTALWDGELYTGTVS SFQGNDPAI SRSQSLRPTKTESSLNWLQDPAFVASAYI PE 
SLGSLQGDDDKIYFFFSETGQEFEFFENTIVSRIARICKGDEGGERVLQQRWTSFLKAQLLC 
SRPDDGFP FNVLQDVFTLS PS PQDWRDTLFYGVFTSQWHRGTTEGS AVCVFTMKDVQRVFSG 
LYKEVNRETQQWYTVTHPVPTPRPGACITNSARERKINSSLQLPDRVLNFLKDHFLMDGQVR 
SRMLLLQPQARYQRVAVHRVPGLHHTYDVLFLGTGDGRLHKAVSVGPRVHIIEELQIFSSGQ 
PVQNLLLDTHRGLLYAASHSGWQVPMANCSLYRSCGDCLLARDPYCAWSGSSCKHVSLYQP 
QLATRPWIQDIEGASAKDLCSASSWSPSFVPTGEKPCEQVQFQPNTVNTLACPLLSNLATR 
LWLRNGAPVNASASCHVLPTGDLLLVGTQQLGEFQCWSLEEGFQQLVASYCPEWEDGVADQ 
TDEGGSVPVI I STSRVSAPAGGKASWGADRS YWKEFLVMCTLFVLAVLLPVLFLLYRHRNSM 
KVFLKQGECASVHPKTCPWLPPETRPLNGLGPPSTPLDHRGYQSLSDSPPGARVFTESEKR 

PLSIQDSFVEVSPVCPRPRVRLGSEIRDSW 

Transmembrane domains: 

amino acids 23-46 (type II), 718-738 



WO 00/12708 



PCI7US99/20111 



FTGTTRE 143A 

CTAAGCCGGAGGATGTGCAGCTGCGGCGGCGGCGCCGGCTACGAAGAGGACGGGGACAGGCGCCGTGCGAACCGA 

GCCCAGCCAGCCGGAGGACGCGGGCAGGGCGGGACGGGAGCCCGGACTCGTCTGCCGCCGCCGTCGTCGCCGTCG 

TGCCGGCCCCGCGTCCCCGCGCGCGAGCGGGAGGAGCCGCCGCCACCTCGCGCCCGAGCCGCCGCTAGCGCGCGC 

CGGGCATGGTCCCCTCTTAAAGGCGCAGGCCGCGGCGGCGGGGGCGGGTGTGCGGAACAAAGCGCCGGCGCGGGG 

CCTGCGGGCGGCTCGGGGGCCGCGATGGGCGCGGCGGGCCCGCGGCGGCGGCGGCGCTGCCCGGGCCGGGCCTCG 

CGGCGCTAGGGCGGGCTGGCCTCCGTGGGCXX^GGCAGCGGGCTGAGGGCGCGCGGAGCCTGCGGCGGCGGCGGC 

GGCGGCGGCGGCGGCCCGGCGGGCGGAGCGGCGCGGGCAjnGGCCGCGCGCGGCCGGCGCGCCTGGCTCAGCGTGC 

TGCTCGGGCTCGTCCTGGGCTTCGTGCTGGCCTCGCGGCTCGTCCTGCCCCGGGCrrTCCGAGCTGAAGCGAGCGG 

GCCCACGGCGCCGCGCCAGCCCCGAGGGCTGCCGGTCCGGGCAGGCGGCGGCTTCCCAGGCCGGCGGGGCGCGCG 

GCGATGCGCGCGGGGCGCAGCTCTGGCCGCCCGGCTCGGACCCAGATGGCGGCCCGCGCGACAGGAACTTTCTCT 

TCGTGGGAGTCATGACCGCCCAGAAATACCTGCAGACTCGGGCCGTGGCCGCCTACAGAACATGGTCCAAGACAA 

TTCCTGGGAAAGTTCAGTTCTTCTCAAGTGAGGGTTCTGACACATCTGTACCAATTCCAGTAGTGCCACTACGGG 

GTGTGGACGACTCCTACCCGCCCCAGAAGAAGTCCTTCATGATGCTCAAGTACATGCACGACCACTACTTGGAC^ 

AGTATGAATGGTTTATGAGAGCAGATGATGACGTGTACATCAAAGGAGACCGTCTGGAGAACTTCCTGAGGAGTT 

TGAACAGCAGCGAGCCCCTCTTTCTTGGGCAGACAGGCCTGGGCACCACGGAAGAAATGGGAAAACTGGCCCT 

AGCCTGGTGAGAACTTCTGCATGGGGGGGCCTGGCGTGATCATGAGCCGGGAGGTGCTTCGGAGAATGGTGCCGC 

ACATTGGCAAGTGTCTCCGGGAGATGTACACCACCCATGAGGACGTGGAGGTGGGAAGGTGTGTCOGGAGGTTTG 

CAGGGGTGCAGTGTGTCTGGTCTTATGAGATGCGGCAGCTTTTTTATGAGAATTACGAGCAGAACAAAAAGGGGT 

ACATTAGAGATCTCCATAACAGTAAAATTCACCyiAGCTATCACATTACACCCCAACAAAAACCCACCCTACCAGT 

ACAGGCTCCACAGCTACATG CTGAGCCGCAAGATATCCGAGCTCCG CC ATCGCACAATACAG CTGCACCGCGAAA 

TTGTCCTGATGAGCJ^TACAGCAACACAGAAATTC^TAAAGAGGACCrrCCAGCTGGGAATC.CCTCCCTCCTTCA 

TGAGGTTTCAGCCCCGCCAGCGAGAGGAGATTCTGGAATGGGAGTTTCTGACTGGAAAATACTTGTATTCGGCAG 

TTGACGG CCAGCCCCCTCGAAGAGGAATGG ACTCCGCCCAGAGGG AAGC CTTGGACGACATTGTCATGCAGGTCA 

TGGAGATGATCAATGCCAACGCC^GACCAGAGGGCGCATCATTGACTTCAAAGAGATCCAGTACGGCTACCGCC 

GGGTGAACC CCATGTATGGGGC TG AGTACATCCTGGACCTGCTGCTTCTGTACAAAAAGCACAAAGGGAAGAAAA 

TGACGGTCCCTGTGAGGAGGCACGCGTATTTACAGCAGACTTTCAGCAAAATCCAGTTTGTGGAGCATGA 

TGGATGCACAAGAGTTGGCCAAGAGAATCAATCAGGAATCTGGATCCTTGTCCTT^ 

AGCTCGTCCCCTTTCAGCTCCCTGX3GTCGAAGAGTGAGCACAAAGAACCCAAAGATAAAAAGATAAACATACTGA 
TTCCTTTGTCTGGGCGTTTCGACATGTTTGTGAGATTTATGGGAAACTTTGAGAAGACGTGTCTTATCCCCAA 
AGAACGTCAAGCTCGTGGTTCTGCTTTTCAATTCTGACTCCAACCCTGACAAGGCCAAACAAGTTGAACTGATGA 
GAGATTACCGCATTAAGTACCCTAAAGCCGACATGCAGATTTTGCC^ 

CCCTGGAAGTAGGATCCTCCCAGTTTAACAATGAATCTITGCTCTTCTTCTGCGACGTCGACCTCGTGTTTACTA 
CAGAATTCCTTGAGCGATGTCGAGCAAATACAGTTCTGGGCCAACAAATATATTTTCCA 
ATGACCOUUGATTGTTTATAGTGGGAAAGTTCCCAGTGACAACCATTTTGCCTTTACTCAGA 
GGAGAAACTATGGGTTTGGCATCACGTGTATTTATAAGGGAGATC^ 

TCCAAGGCTGGGGGCTGGAGGATGTGGACCTTTTCAACAAGGTTGTCCAGGCAGGTTTGAAGACGT^ 
AGGAAGTAGGAGTAGTCCACGTCCACCATCCTGTCTTTTGTGA 

GCTTGGGGTCCAAAGCT^TCGACCTATGGGTCCACCCAGCAGCTGGCTGAGATGTGGCTGGAAAAAAATGATCCAA 
GTTACAGTAAAAGCAGCAATAATAATGGCTCAGTGAGGACAGCCTAATGTCCAGCTTTGCTGGAAAAGACG 
TAATTATCTAATTTATTTTTCAAAAATTTTTTGTATGATCAGTTTTTGAAGTCC^ 
AAGTGGTTTTCTTACATAGGACTCCTTTAAGATTGAGCTT^ 

CACATCTTCTTGCTGAACATTATGTAGCAGACCTGCTTAACTTTGACTTGAAATGTACCTGATGAACAAAACTTT 

TTTAAAAAAATGTTTTCTTTTGAGACCCTTTGCTCCAGTCCTATGGCAGAAAACGTGAACATTCCT 

TATTGTAACAAAACACTGTAACTCTGGTAAATGTTCTGTTGTGATTGTTAACATTCCACAGATTCTACCTTTTGT 

GTTTTGTTTTTTTTTTTTACAATTGTTTTAAAGCCATTTCATGTTCCAGT^ 

GCTGTTTCATCATTGTCTTCAGGAGAGCTTTCCAGAGTTGATCATTTCCTCTCATGGTACTCTGCT 
CACGTAGGTTTTTTGTTTGTTTTGTTTTGTTCTTT'nTl'GAGACGGAGTCTCACTCTGTTACCCAGGCTGGAATG 

CAGTGGCGCAATCTTGGCTCACTTTAACCTCCACTTCCCTG^ 

AGCTGGGATTACAGGCACACACCACCACGCCCAGNTAGTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCAT 

GCAAGCCCAGCTGGCCACGTAGGTTTTAAAGCAAGGGGCGTGAAGAAGGCACAGTGAGGTATGTGGCTGTTCTCG 

TGGTAGTTCATTCGGCCTAAATAGACCTGGCATTAAATTTCAAGAAGGATTTGGCATTTTCT 

CTCTTTAAAGGGTAAAATATTAATGTTTAGAATGACAAAGATGAATTATTACAATAAA 

GAAACATACACACATACACCCTAATCAAAACGTTGGGGAAAAATGTATTTGGTTTTGTTCCTTT 

TGTTATGTGGGTGGAGATGGTTTTCATTCTTTCATTACTGTTTTGTTTTATCCTTTGTATCT 

TTTATTTAATATCTGTTGTTCAGAGCTCTGCCATTTCTO 

GAGTGTGTTTAGTCTGTTTTATTTGCAGTAAACCGATCTCCAAAGATTTCCTTTT 



WO 00/12708 



PCT/US99/20111 



FTGTJRE 143B 



TTAATTTTTATATTCCTTACTGTTTTACTA^ 

GACAAAAGTGAAATGAATCTGTCATTATACCAGAAAGTTAAATTCT^ 

TTCATTTAGATTTCAAACAGTGATAGACTTGCCATTT^^ 

CCTGATGCTCATTTGGAAAAATAAACCAGTGAACAATATTT^ 

ATTCCTGTTTTAGCTGAAGAATTGTATTACATTTGGAGAGTAAAAAACTTAAACACGAAAAAA 
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FIGURE 144 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA68836 
xsubunit 1 of 1, 802 aa, 1 stop 
><MW: 91812, pi: 9.52, NX{S/T): 3 

MAARGRl^WLSVLLGLVLGFVLASRLVLPRASELKRAGPRRRASPEGCRSGQAAASQAGG^ 
GDARGAQLWPPGSDPDGGPRDRNFLFVGVMTAQKYLQTRAVAAYRTWSKTIPGKVQFFSSEG 
SDTSVPIPWPLRGVDDSYPPQKKSFMMLKYMHDHYLDKYEWFMRADDDVYIKGDRLENFLR 
SLNSSEPLFLGQTGLGTTEEMGKLALEPGENFCMGGPGVIMSREVLRRMVPHIGKCLREMYT 
THEDVEVGRCVRRFAGVQCVWSYEMRQLFYENYEQNKKGYIRDLHNSKIHQAITLHPNKNPP 
YQYRLHSYMLSRKISELRHRTIQLHREIVLMSKYSNTEIHKEDLQLGIPPSFMRFQPRQREE 
ILEWEFLTGKYLYSAVDGQPPRRGMDSAQREALDDIVMQVMEMINANAKTRGRIIDFKEIQY 
GYRRVNPMYGAEYILDLLLLYKKHKGKKMTVPVRRHAYLQQTFSKIQFVEHEELDAQELAKR 
INQESGSLSFLSNSLKKLVPFQLPGSKSEHKEPKDKKINILIPLSGRFDMFVRFMGNFEKTC 
LIPNQNVKLWLLFNSDSNPDKAKQVELMRDYRIKYPKADMQILPVSGEFSRAIjALEVGSSQ 
FNNESLLFFCDVDL VFTTEFLQRCRANTVLGQQ I YFP 1 1 FSQ YDPK I VYSGKVPSDNHFAFT 
QKTGFWRNYGFG I TCI YKGDLVRVGGFDVS IQGWGLEDVDLFNKWQAGLKTFRSQEVGWH 
VHHPVFCDPNLDPKQYKMCLGSKASTYGSTQQIAEMWLEKNDPSYSKSSNNNGSVRTA 

Signal peptide: 

amino acids 1-23 
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FIGURE 145 

GGACAACCGTTGCTGGGTGTCCCAGGGCCTGAGGCAGGACGGTACTCCGCTGACACCTTCCC 
TTTCGGCCTTGAGGTTCCCAGCCTGGTGGCCCCAGGACGTTCCGGTCGCATGGCAGAGTGCT 
ACGGACGACGCCTATGAAGCCCTTAGTCCTTCTAGTTGCGCTTTTGCTATGGCCTTCGTCTG 
TGCCGGCTTATCCGAGCATAACTGTGACACCTGATGAAGAGCAAAACTTGAATCATTATATA 
CAAGTTTTAGAGAACCTAGTACGAAGTGTTCCCTCTGGGGAGCCAGGTCGTGAGAAAAAATC 
TAACTCTCCAAAACATGTTTATTCTATAGCATCAAAGGGATCAAAATTTAAGGAGCTAGTTA 
CACATGGAGACGCTTCAACTGAGAATGATGTTTTAACCAATCCTATCAGTGAAGAAACTACA 
ACTTTCCCTACAGGAGGCTTCACACCGGAAATAGGAAAGAAAAAACACACGGAAAGTACCCC 
ATTCTGGTCGATCAAACCAAACAATGTTTCCATTGTTTTGCATGCAGAGGAACCTTATATTG 
AAAATGAAGAG CCAGAGC CAGAGC CGGAGCCAGCTGCAAAACAAACTGAGGC ACCAAGAATG 
TTGCCAGTTGTTACTGAATCATCTACAAGTCCATATGTTACCTCATACAAGTCACCTGTCAC 
CACTTTAGATAAGAGCACTGGCATTGAGATCTCTACAGAATCAGAAGATGTTCCTCAGCTCT 
CAGGTGAAACTGCGATAGAAAAACCCGAAGAGTTTGGAAAGCACCCAGAGAGTTGGAATAAT 
GATGACATTTTGAAAAAAATTTTAGATATTAATTCACAAGTGCAACAGGCACTTCTTAGTGA 
CACCAGCAACCCAGCATATAGAGAAGATATTGAAGCCTCTAAAGATCACCTAAAACGAAGCC 
TTGCTCTAGCAGCAGCAGCAGAACATAAATTAAAAACAATGTATAAGTCCCAGTTATTGCCA 
GTAGGACGAACAAGTAATAAAATTGATGACATCGAAACTGTTATTAACATGCTGTGTAATTC 
TAGATCTAAACTCTATGAATATTTAGATATTAAATGTGTTCCACCAGAGATGAGAGAAAAAG 
CTGCTACAGTATTCAATACATTAAAAAATATGTGTAGATCAAGGAGAGTCACAGCCTTATTA 
AAAGTTTATTAAACAATAATATAAAAATTTTAAACCTACTTGATATTCCATAACAAAGCTGA 
TTTAAGCAAACTGCATTTTTTCACAGGAGAAATAATCATATTCGTAATTTCAAAAGTTGTAT 
AAAAATATTTTCTATTGTAGTTCAAATGTGCCAACATCTTTATGTGTCATGTGTTATGAACA 
ATTTTCATATGCACTAAAAACCTAATTTAAAATAAAATTTTGGTTCAGGAAAAAA 
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FIGURE 146 

>< /usr/seqdb2 /ssb/DNA/Dnaseqs . min/ss . DNA6 8864 
xsubunit 1 of 1, 350 aa, 1 stop 
><MW: 39003, pi: 5.59, NX(S/T): 1 

MKPLVLLVALLLWPSSVPAYPSITVTPDEEQNLNHYIQVLENLVRSVPSGEPGREKKSNSPK 
HVYS I ASKGS KFKELVTHGDASTENDVLTNP I SEETTTFPTGGFTPE IGKKKHTE STPFWS I 
KPNNVS I VLHAE EPY I ENEE PE PE PE PAAKQTE APRML PWTES STS PYVTS YKS PVTTLDK 
STGIEISTESEDVPQLSGETAIEKPEEFGKHPESWNNDDILKKILDINSQVQQALLSDTSNP 
AYREDIEASKDHLKRSLALAAAAEHKLKTMYKSQLLPVGRTSNKIDDIETVINMLCNSRSKL 
YEYLD I KCVP PEMREKAATVFNTLKNMCRSRRVTALLKVY 

Signal peptide: 
amino acids 1-19 
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FIGURE 147 

CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTG 
ACCTCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACA 
GGAATATCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACA 
GTGGCAAGTCACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCT 
CCTGCTCCCTCTTTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAG 
TTCCATGCTGTGGTCCACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACA 
GTATCGAGGGAGAACTGAGTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGC 
TAAAAAACATCACTCCCTCGGACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTAC 
GATGAGGAGGCCACCTGGGAGCTGCGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCAT 
CGTGGGATATGTTGACGGAGGTATCCAGTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGC 
CCACAGCCAAGTGGAAAGGTCCACAAGGACAGGATTTGTCTTCAGACTCCAGAGCAAATGCA 
GATGGGTACAGCCTGTATGATGTGGAGATCTCCATTATAGTCCAGGAAAATGCTGGGAGCAT 
ATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGAGGTGGAATCCAAGGTATTGATAGGAG 
AGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTATTTTACTCGGGTTACTCTGTGGT 
GCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTCAAATCCAAAGGGAAAATCCA 
GG CGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCC CGGAAACACG 
CAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAA 
ACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGATTTACAAGGAA 
GAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGACGTGGGAC 
AAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGAGAGGGGGAAGAACAATGTG 
ACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTCAC 
ATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCC 
TGGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACC 
CTGCTGACATGTCAGTTTGAAGGCTTGTTGAGACCCTATATCCAGCATGCGATGTATGACGA 
GGAAAAGGGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTG 
CTTAAAGGGCCCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGC 
CCCAGCTTCCTCTCCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGC 
TGAGGTTCTTCTGCCCTGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGAT 
TGGC CTGACCCTGTGGGAGT CAGAAG CC ATGG CTGCCCTGAAGTGGGGACGGAATAGACTC A 
CATTAGGTTTAGTTTGTGAAAACTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCC 
CAGGCTCCTCATTTGCTAGTCACGGACAGTGATTCCTGCCTCACAGGTGAAGATTAAAGAGA 
CAACGAATGTGAATCATGCTTGCAGGTTTGAGGGCACAGTGTTTGCTAATGATGTGTTTTTA 
TATTATACATTTTCCCACCATAAACTCTGTTTGCTTATTCCACATTAATTTACTTTTCTCTA 
TACCAAATCACCCATGGAATAGTTATTGAACACCTGCTTTGTGAGGCTCAAAGAATAAAGAG 
GAGGTAGGATTTTTCACTGATTCTATAAGCCCAGCATTACCTGATACCAAAACCAGGCAAAG 
AAAACAG AAG AAG AGGAAGG AAAAC TAC AGGT C CAT AT C C CT C ATT AACACAG ACACAAAAA 
TTCTAAATAAAATTTTAACAAATTAAACTAAACAATATATTTAAAGATGATATATAACTACT 
CAGTGTGGTTTGTCCCACAAATGCAGAGTTGGTTTAATATTTAAATATCAACCAGTGTAATT 
CAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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FIGURE 148 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA68866 
xsubunit 1 of l, 466 aa, l stop 
><MW: 52279, pi: 6.16, NX(S/T): 2 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHA 
WHLYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEE 
ATWELRVAALGSLPL I S I VG YVDGG I QLLCLS SGWFPQPTAKWKGPQGQDLS SDSRANADG Y 
SLYDVE I S 1 1 VQENAGS I LCS I HLAEQSHE VESKVLIGETFFQPSPWRLAS ILLGLLCGALC 
GVVMGMIIVFFKSKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTW 
HRKAPQEVPHSEKRFTRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLS 
PNWGYWVLRLTTEHLYFTFNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQ$LIYTLLT 
CQFEGLLRPYIQHAMYDEEKGTPIFICPVSWG 

Signal peptide: 
amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 



JSo 76 
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FIGURE 149 

CCTTCACAGGACTCTTCATTGCTGGTTGGCAATGATGTATCGGCCAGATGTGGTGAGGGCTA 

GGAAAAGAGTTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTC 

CTGGCAGTGTGCATTGGACTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAA 

TTACTATAGCACATTGTCATTTACAACTGACAAACTATATGCTGAGTTTGGCAGAGAGGCTT 

CTAACAATTTTACAGAAATGAGCCAGAGACTTGAATCAATGGTGAAAAATGCATTTTATAAA 

T CTCCATTAAGGGAAGAATTTGTCAAGT CTCAGGTTATCAAGTTCAGTCAACAGAAGCATGG 

AGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGAAACTGTAG 

ATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTA 

GATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGA'CAGCTATCTAAACCA 

TTGCTGCGGAACACGAAGAAGTAAAAGTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAG 

AAGTAGAAGAGGGTGAATGGCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGT 

GGAGCAACCTTAATTAATGCCACATGGCTTGTGAGTGCTGCTCACTGTTTTACAACATATAA 

GAAC CCTG CC AGATGGACTGCTTCCTTTGGAGTAACAATAAAACCTTCGAAAATGAAACGGG 

GTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACATGACTATGATATTTCT 

CTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTtTCCCTGA 

TGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAA 

ATGATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACT 

TGCAATGAACCTCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTT 

AGAAGGAAAAACAGATGCATGCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTA 

GAGATATCTGGTACCTTGCTGGAATAGTGAGCTGGGGAGATGAATGTGCGAAACCCAACAAG 

CCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTGGATTACTTCAAAAACTGGTATCTA 

AGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTGGGTGTGGAGGCCATT 

TTTAGAGATACAGAATTGGAGAAGACTTGCAAAACAGCTAGATTTGACTGATCTCAATAAAC 

TGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTG 

CCAGATCAACTCTGTCATCTGTGAGCAATAGTTGAAACTTTATGTACATAGAGAAATAGATA 

ATACAATATTACATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGAC 

TTGTTGACATAAATTTGTAATGCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTC 

AGCTCCTCTCATTTCAGCAAATATCCATTTTCAAGGTGCAGAACAAGGAGTGAAAGAAAATA 

TAAGAAGAAAAAAATCCCCTACATTTTATTGGCACAGAAAAGTATTAGGTGTTTTTCTTAGT 

GGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACAGCAGAATACCAATC 

ACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAGATATA 

TCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTG 

ACCTATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATAT 

TTATTTAACATTGTTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 150 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA68871 
xsubunit 1 of 1, 423 aa, 1 stop 
><MW: 47696, pi: 8.96, NX(S/T): 3 
MNTCRPDVVRARKRVOT^ 

KLYAEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICR 
FHSTEDPETVDKIVQLVLHEKiQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTL 
GQSLRIVGGTEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFG 
VTIKPSKMKRGLRRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDV 
MFVTGFGALKNDGYSQNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGD 
SGGPLVSSDARD IWYLAG I VSWGDECAKPNKPGVYTRVTALRDWI TSKTG I 

Transmembrane domain: 
amino acids 21-40 (type II) 
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FIGURE 151 

GTCGAAGGTTATAAAAGCTTCCAGCCAAACGGCATTGAAGTTGAAGATACAACCTGACAGCA 
CAGCCTGAGATCTTGGGGATCCCTCAGCCTAACACCCACAGACGTCAGCTGGTGGATTCCCG 
CTGCATCAAGGCCTACCCACTGTCTCCATGCTGGGCTCTCCCTGCCTTCTGTGGCTCCTGGC 
CGTGACCTTCTTGGTTCCCAGAGCTCAGCCCTTGGCCCCTCAAGACTTTGAAGAAGAGGAGG 
CAGATGAGACTGAGACGGCGTGGCCGCCTTTGCCGGCTGTCCCCTGCGACTACGACCACTGC 
CGACACCTGCAGGTGCCCTGCAAGGAGCTACAGAGGGTCGGGCCGGCGGCCTGCCTGTGCCC 
AGGACTCTCCAGCCCCGCCCAGCCGCCCGACCCGCCGCGCATGGGAGAAGTGCGCATTGCGG 
CCGAAGAGGGCCGCGCAGTGGTCCACTGGTGTGCCCCCTTCTCCCCGGTCCTCCACTACTGG 
CTGCTGCTTTGGGACGGCAGCGAGGCTGCGCAGAAGGGGCCCCCGCTGAACGCTACGGTCCG 
CAGAGCCGAACTGAAGGGGCTGAAGCCAGGGGGCATTTATGTCGTTTGCGTAGTGGCCGCTA 
ACGAGGCCGGGGCAAGCCGCGTGCCCCAGGCTGGAGGAGAGGGCCTCGAGGGGGCCGACATC 
CCTGCCTTCGGGCCTTGCAGCCGCCTTGCGGTGCCGCCCAACCCCCGCACTCTGGTCCACGC 
GGCCGTCGGGGTGGGCACGGCCCTGGCCCTGCTAAGCTGTGCCGCCCTGGTGTGGCACTTCT 
GCCTGCGCGATCGCTGGGGCTGCCCGCGCCGAGCCGCCGCCCGAGCCGCAGGGGCGCTCTGA 
AAGGGGCCTGGGGGCATCTCGGGCACAGACAGCCCCACCTGGGGCGCTCAGCCTGGCCCCCG 
GGAAAGAGGAAAACCCGCTGCCTCCAGGGAGGGCTGGACGGCGAGCTGGGAGCCAGCCCCAG 
GCTCCAGGGCCACGGCGGAGTCATGGTTCTCAGGACTGAGCGCTTGTTTAGGTCCGGTACTT 
GGCGCTTTGTTTCCTGGCTGAGGTCTGGGAAGGAATAGAAAGGGGCCCCCAATTTTTTTTTA 
AGCGGCCAGATAATAAATAATGTAACCTTTGCGGTTAAAAAAAAAAAAAAAAAA 
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FIGURE 152 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA68874 
xsubunit 1 of 1, 238 aa, 1 stop 
><MW: 25262, pi: 6.44, NX(S/T): 1 

MLGSPCLLWLLAVTFLVPRAQPLAPQDFEEEEADETETAWPPLPAVPCDYDHCRHLQVPCKE 
LQRVGPAACLCPGLSSPAQPPDPPRMGEVRIAAEEGRAVVHWCAPFSPVLHYWLLLWDGSEA 
AQKGP PtiNATVRRAELKGLK PGG I YWCWAANE AGASRVPQAGGEGLEGAD I PAFGP CSRL 
AVPPNPRTLVHAAVGVGTALALLSCAALVWHFCLRDRWGCPRRAAARAAGAL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 194-220 

N-glycosylation site, 

amino acids 132-135 
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FIGURE 153 

AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCC 

CTGCCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGG 

CCCAGCGCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGC 

TTCTCCCTTACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCT 

CTACGCCCTCAATCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGA 

TGAGGGACTACCTAAATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTC 

ATTTTGACTTACTTTCCTGTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCAT 

TGTGGGGATGTTAGGATATTGTGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACT 

TTGGAAGTTTGCTTGTCATTTTCTGTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAG 

GAACTTATGGTTCCAGTACAATGGTCAGATATGGTCACTTTGAAAGCCAGGATGACAT^ATTA 

TGGATTACCTAGATATCGGTGGCTTACTCATGCTTGGAATTTTTTTCAGAGAGAGTTTAAGT 

GCTGTGGAGTAGTATATTTCACTGACTGGTTGGAAATGACAGAGATGGACTGGC CC CCAGAT 

TCCTGCTGTGTTAGAGAATTC C CAGGATGTTCCAAACAGGCCCACCAGGAAGAT CT CAGTGA 

CCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTTTTTGAGAGGAACCAAACAACTGC 

AGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAATCCTGGCCATGAtTCTCACC 

ATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGGACAGACCAAATGATGTC 

CTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACTGTTGAAACCAAGCC 

TGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACTTTGAGATGGAG 

GAGTTATAAAAAGAAATGTCACAGAAGAAAACCACAAACTTGTTTTATTGGACTTGTGAATT 

TTTGAGTACATACTATGTGTTTCAGAAATATGTAGAAATAAAAATGTTGCCATAAAATAACA 

CCTAAGCATATACTATTCTATGCTTTAAAATGAGGATGGAAAAGTTTCATGTCATAAGTCAC 

CACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGC 

CTGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTC 

CGCATCCATGCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGTTGATTTACTT 

CTACCAACTAGTATATAAAGTACTAATTAAATGCTAACATAGGAAGTTAGAAAATACTAATA 

ACTTTTATTACTCAGCGATCTATTCTTCTGATGCTAAATAAATTATATATCAGAAAACTTTC 

AATATTGGTGACTACCTAAATGTGATTTTTGCTGGTTACTAAAATATTCTTACCACTTAAAA 

GAGCAAGCTAACACATTGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAA 

TCTGTATAATTCAGTCGATTTCAGTTCTGATAATGTTAAGAATAACCATTATGAAAAGGAAA 

ATTTGTCCTGTATAGCATCATTATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGT 

CCTGGGCTTATATTACACATATAACTGTTATTTAAATACTTAACCACTAATTTTGAAAATTA 

CCAGTGTGATACATAGGAATCATTATTCAGAATGTAGTCTGGTCTTTAGGAAGTATTAATAA 

GAAAATTTGCACATAACTTAGTTGATTCAGAAAGGACTTGTATGCTGTTTTTCTCCCAAATG 

AAGACTCTTTTTGACACTAAACACTTTTTAAAAAGCTTATCTTTGCCTTCTCCAAACAAGAA 

GCAATAGTCTCCAAGTCAATATAAATTCTACAGAAAATAGTGTTCTTTTTCTCCAGAAAAAT 

GCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATTCTTTGTTTTATTTCACTGATTA 

ATATACTGTGGCAAATTACACAGATTATTAAATTTTTTTACAAGAGTATAGTATATTTATTT 

GAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTATTTCTCAGAATATGGAA 

AGAAAATTAAAATGTGTCAATAAATATTTTCTAGAGAGTAA 
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FIGURE 154 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA68880 
xsubunit 1 of 1, 305 aa, 1 stop 
><MW: 35383, pi: 5.99, NX(S/T): 0 

MAREDSVKCLRCLLYALNLLFWLMS I SVLAVS AWMRDYLNNVLTLTAETRVEEAVI LTYFPV 
VHP VM I AVCC FL 1 1 VGMLG YCGTVKRNLLLLAWY FG SLLV I F CVELACGVWT YEQELMVPVQ 
WSDMVTLKARMTOTGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCW 
PGCSKQAHQEDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWAL 
YYDRREPGTDQ^4MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FTGURE 155 

GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCC 

TGCACTCGGGCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGA 

CCTGTGTGGGGAGGCCCTCCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAG 

ACCGGGAGGATCACAGAGCCAGCATGTTACAGGATCCTGACAGTGATCAACCTCTGAACAGC 

CTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCCATGGAGACCTTCAGAAAGGTGGGGAT 

CCCCATCATCATAGCACtACTGAGCCTGGCGAGTATCATCATTGTGGTTGTCCTCATCAAGG 

TGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGAGGAAGCAG 

CTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTT 

CCCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGG 

ACTCGGCCACAGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAG 

ACAGCCTGTAGGCAGATGGGCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGA 

TGTTGTTGAAATCACAGAAAACAGCCAGGAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTC 

TCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTGTGGGAAGAGCCTGAAGACCCCCCGT 

GTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGGTCAGCATCCAGTACGA 

CAAACAGCACGTCTGTGGAGGGAGCATCCTGGACCCCCACTGGGTCCTCACGGCAGCCCACT 

GCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGC 

AGCTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAA 

AGACAATGACATCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGC 

CCATCTGTCTGCCCTTCTTTGATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGA 

TGGGGCTTTACGAAGCAGAATGGAGGGAAGATGTCTGACATACTGCTGCAGGCGTCAGTCCA 

GGTCATTGACAGCACACGGTGCAATGCAGACGATGCGTACCAGGGGGAAGTCACCGAGAAGA 

TGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGTGACAGTGGTGGGCCC 

CTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGGCTGCGG 

GGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATG 

TCTGGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTG 

CCCTGCCCACCTGGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCC 

CTCTGCCCACAGCCTCAGCATTTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCC 

TCGCAGCCCAGAGGCGCCCAGAGGAAGTCAGCAGCCCTAGCTCGGCCACACTTGGTGCTCCC 

AGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCTCAGGGGTATTGCTAAGCCAAGAA 

GGAACTTTCCCACACTACTGAATGGAAGCAGGCTGTCTTGTAAAAGCCCAGATCACTGTGGG 

CTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAAGCCTA 

CTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTT 

ACCTACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCT 

CTGGCAAAAAAAAAAAA 
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FIGURE 156 

x/usr/ seqdb2/ sst/DNA/Dnaseqs .min/ss .DNA68885 
xsubunit 1 of 1, 432 aa, 1 stop 
><MW: 47644, pi: 5.18, NX(S/T): 2 

MLQDPDSDQPLNSLDVKPLRKPRI PMETFRKVGI PI I IALLSLAS 1 1 1 VWLIKVILDKYYF 
LCGQPLHF I PRKQLCDGELDCPLGEDEEHCVKS F PEGP AVAVRLSKDRSTLQVLDS ATGNWF 
SACFDNFTEALAETACRQMGYSRAVE IGPDQDLD WE I TENSQELRMRNSSGPCLSGSLVSL 
HCLACGKSLKTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS I LDPHWVLTAAHCFRKHTDV 
FNWKVRAGSDKLGSFPSLAVAKI 1 1 IEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFD 
EELTPATPLWIIGWGFTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPE 
GGVDTCQGDSGGPLMYQSDQWHVVGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE 157 

GGGCTGAGGCACTGAGAGACCGGAAAGCCTGGCATTCCAGAGGGAGGGAAACGCAGCGGCATCCCCAGGCTCCAG 
AGCTCCCTGGTGACAGTCTGTGGCTGAGCAT^CCCTCCCAGCCCTGGGCCTGGACCCCTGGAGCCTCCTGGGCC 
TTTTCCTCTTCCAACTGCTTCAGCTGCTGCTGCCGACGACGACCGCGGGGGGAGGCGGGCAGGGGCCCATGCCCA 
GGGTCAGATACTATGCAGGGGATGAACGTAGGGCACTTAGCTTCTTCCACCAGAAGGGCCTCCAGGATTTTGACA 
CTCTGCTCCTGAGTGGTGATGGAAATACTCTCTACGTGGGGGCTCGAGAAGCCATTCTGGCCTTGGATATCCAGG 
ATCCAGGGGTCCCCAGGCTAAAGAACATGATACCGTGGCCAGCCAGTGACAGAAAAAAGAGTGAATGTGCCTTTA 
AGAAGAAGAGCAATGAGACACAGTGTTTCAACTTCATCCGTGTCCTGGTTTCT^ 

CCTGCGGCACCTTCGCCTTCAGCCCTGCTTGTACCTTC^TTGAACTTCAAGATTCCTACCTGTTGCCCATrrCGG 

AGG ACAAGGTC ATGGAGGG AAAAGGC C AAAGCCC CTTTGACCC CGCTCACAAGC AT ACGG CTGTCTTGGTGGATG 

GGATGCTCTTATTCTGGTACTATGAACAACTTCCTGGGCAGTGAGCCCATCCTGATGCGCACACTGGGATCCCAGC 

CTGTCCTCAAGACCGACAACTTCCTCCGCTGGCTGCATCATGACGCCTCCTTTGTGGCAGCCATCCCTTCGACCC 

AGGTCGTCTACTTCTTCTTCGAGGAGACAGCCAGCGAGTTTGACTTCTTTGAGAGGCTCCACACATC 

CTAGAGTCTGCAAGAATGACGTGGGCGGCGAAAAGCTGCTGCAGAAGAAGTGGACCACCTTCCTGAAGGCCCAGC 

TGCTCTGCACCCAGCCGGGGCAGCTGCCCTTCAACGTCATCCGCCACGCGGTCCTGCTCCCCGCCGATTCTCCCA 

CAGCTCCCCACATCTACGCAGTCTTCACCTCCCAGTGGCAGGTTGGCGGGACCAGGAGCTCTGCGGTTTGTGCCT 

TCTCTCTCTTGGACATTGAACGTGTCTTTAAGGGGAAATACAAAGAGTTGAACAA^ 

CTTATAGGGGCCCTGAGACCAACCCCCGGCCAGGCAGTTGCTCAGTGGGCCCCTCCTCTGATAAGGCCCTGACCT 
TCATGAAGGACCATTTCCTGATGGATGAGCAAGTGGTGGGGACGCCCCTGCTGGTGAAATCTGGCGTGGAGTATA 
CACGGCTTGCAGTGGAGACAGCCCAGGGCCTTGATGGGCACAGCCATCTTGTCATGTACCTGGGAACCACCACAG 
GGTCGCTC C ACAAGGCTGTGGTAAGTGGGGACAGCAGTG CTC AT CTGGTGG AAGAGATTCAGCTGTTCCCTGACC 
CTGAACCTGTTCGCAACCTGCAGCTGGCCCCCACCCAGGGTGCAGTGTTTGTAGGCTTCTCAGGAGGTGTCTGGA 
GGGTGCCCCGAGCCAACTGTAGTGTCTATGAGAGCTGTGTGGACTGTGTCCTTGCCCGGGACCCCCACTGTGCCT 
GGGACCCTGAGTCCCGAACCTGTTGCCTCCTGTCTGCCCCCAACCTGAACTCCTGGAAGCAGGACATGGAGCGGG 
GGAACCCAGAGTGGGCATGTGCCAGTGGCCCCATGAGCAGGAGCCTTCGGCCTCAGAGCCGCCCGCAAATCATTA 
AAGAAGTCCTGGCTGTCCCCAACTCCATCCTGGAGCTCCCCTGCCCCCACCTGTCAGCCTTGGCCTCTTATTATT 
GGAGTCATGGCCCAG CAGC AGTCCCAGAAGCCTCTTC CACTGTCTACAATGGCT CCCTCTTGCTGATAGTGCAGG 
ATGGAGTTGGGGGTCTCTACCAGTGCTGGGCAACTGAGAATGGCTTTTCATACCCTGTGATCTCCTACTGGGTGG 
ACAGCCAGGACCAGACCCTGGCCCTGGATCCTGAACTGGCAGGCATCCCCCGGGAGCATGTGAAGGTCCCGTTGA 
CC^GGGTCAGTGGTGGGGCCGCCCTGGCTGCCCAGCAGTCCTACTGGCCCCACTTTGTCACTGTCACTGTCCTCT 
TTGCCTTAGTGCTTTCAGGAGCCCTCATCATCCTCGTGGCCTCCCCATTGAGAGCACTCCGGGCTCGGGGCAAGG 
TTCAGGGCTGTGAGACCCTGCGCCCTGGGGAGAAGGCCCCGTTAAGC^GAGAGCAACACCTCCAGTCTCCCAAGG 
AATGCAGGACCTCTGCC^GTGATGTGGACGCTGACAACAACTGCCTAGGCACTGAGGTAGCTTAAACTCTAGGCA 
CAGGCCGGGGCTGCGGTGCAGGCACCTGGCCATGCTGGCTGGGCGGCCCAAGCACAGCCCTGACTAGGATGACAG 
CAGCACAAAAGACCACCTTTCTCCCCTGAGAG<3AGCTTCTGCTACTCTGCATCACTGATGACACTCAGCAGGGTG 
ATGCACAGCAGTCTGCCTCCCCTATGGGACTCCCTTCTACCAAGCACATGAGCTCTCTAACAGGGTGGGGGCTAC 
CCCCAGACCTG CTCC TACACT^^TATTGAAGAACCTGG AG AGGATCCTTCAGTTCTGGCCATTCCAGGG ACCCTC 
CAGAAACACAGTGTTTCAAGAGACCCTAAAAAACCTGCCTGTCCCAGGACCCTATGGTAATGAACACCAAACATC 
TAAACAATCATATGCTAACATGCCACTCCTGGAAACTCC^ 

TCTCCCAGGGTCATGCAGGGATCTGCTCCCTCOTGCTTCCCTTACCAGTCGTGCACCGCTGACTCCCAGGAAGTC 
TTTCCTGAAGTCTGACCACCTTTCTTCTTGCTTCAGTTGGGGCAGACTCTGATCCCTTCTGCCCTO 
CAGGGGTAATCTGAGCCTTCTTCACTCCTTTACCCTAGCTGACCCCTTCACCTCTCCCCCT 
TTTGGGATTCAGAAAACTGCTTGTCAGAGACTGTTTATTTTTTATTAAAAATATAAGGCTTAAAAAAA 
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FIGURE 158 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71166 
xsubunit 1 of 1, 761 aa, 1 stop 
><MW: 83574, pi: 6.78, NX (S/T) : 4 

MALPALGLDPWSLLGLFLFQLLQLLLPTTTAGGGGQGPMPRVRYYAGDERRALSFFHQKGLQ 
DFDTLLLSGDGNTLYVGAREAILALDIQDPGVPRLKNMIPWPASDRKKSECAFKKKSNETQC 
FNF I RVLVS YNVTHLYTCGTFAFS PACTF I ELQDS YLL P I SEDKVMEGKGQS P FDPAHKHTA 
VLVDGMLYSGTMmFLGSEPILMRTLGSQPVLKTDNFLRWLHHDASFVAAIPSTQVVYFFFE 
ETASEFDFFERLHTSRVARVCKNDVGGEKLLQKKWTTFLKAQLLCTQPGQLPFNVIRHAVLL 
PADSPTAPHIYAVFTSQWQVGGTRSSAVCAFSLLDIERVFKGKYKELNKETSRWTTYRGPET 
NPRPGSCSVGPSSDKALTFMKDHFLMDEQWGTPLLVKSGVEYTRLAVETAQGLPGHSHLVM 
YLGTTTGSLHKAWSGDSSAHLVEEIQLFPDPEPVRNLQLAPTQGAVFVGFSGGVWRVPRAN 
CSVYESC^O^ARDPHCAWDPESRTCCLLSAPNLNSWKQDMERGNPEWACASGPMSRSLRP 
QSRPQIIKEVLAVPNSILELPCPHLSALASYYWSHGPAAVPEASSTVYNGSLLLIVQDGVGG 
LYQCWATENGFSYPVISYWVDSQDQTLALDPELAGIPREHVKVPLTRVSGGAALAAQQSYWP 
HFVTVTVLFALVLSGALIILVASPLRALRARGKVQGCETLRPGEKAPLSREQHLQSPKECRT 
S AS DVDADNNCLGTE VA 

Signal peptide: 

amino acids 1-30 

Transmembrane domains : 

amino acids 136-156, 222-247, 474-490, 685-704 
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FTOTTRE 159 

AGGGTCCCTTAGCCGGGCGCAGGGCGCGCAGCCCAGGCTGAGATCCGCGGCTTCCGTAGAAG 

TGAGC ATG GCTGGGCAGCGAGTGCTTCTTCTAGTGGGCTTCCTTCTCCCTGGGGTCCTGCTC 

TC AGAGGCTGCCAAAATCCTGACAATAT CTAC AGTAGGTGGAAGC CATTATCTACTGATGGA 

CCGGGTTTCTCAGATTCTTCAAGATCACGGTCATAATGTCACCATGCTTAACCACAAAAGAG 

GTCCTTTTATGCCAGATTTTAAAAAGGAAGAAAAATCATATCAAGTTATCAGTTGGCTTGCA 

CCTGAAGATCATCAAAGAGAATTTAAAAAGAGTTTTGATTTCTTTCTGGAAGAAACTTTAGG 

TGGCAGAGGAAAATTTGAAAACTTATTAAATGTTCTAGAATACTTGGCGTTGCAGTGCAGTC 

ATTTTTTAAATAGAAAGGATATCATGGATTCCTTAAAGAATGAGAACTTCGACATGGTGATA 

GTTGAAACTTTTGACTACTGTCCTTTCCTGATTGCTGAGAAGCTTGGGAAGCCATTTGTGGC 

CATTCTTTCCACTTCATTCGGCTCTTTGGAATTTGGGCTACCAATCCCCTTGTCTTATGTTC 

CAGTATTCCGTTCCTTGCTGACTGATCACATGGACTTCTGGGGCCGAGTGAAGAATTTTCTG 

ATGTTCTTTAGTTTCTGCAGGAGGCAACAGCACATGCAGTCTACATTTGACAACACCATCAA 

GGAACATTTCACAGAAGGCTCTAGGCCAGTTTTGTCTCATCTTCTACTGAAAGCAGAGTTGT 

GGTTCATTAACTCTGACTTTGCCTTTGATTTTGCTCGACCTCTGCTTCCCAACACTGTTTAT 

GTTGGAGGCTTGATGGAAAAACCTATTAAACCAGTACCACAAGACTTGGAGAAClTCATTGC 

CAAGTTTGGGGACTCTGGTTTTGTCCTTGTGACCTTGGGCTCCATGGTGAACACCTGTCAGA 

ATCCGGAAATCTTCAAGGAGATGAACAATGCCTTTGCTCACCTACCCCAAGGGGTGATATGG 

AAGTGTCAGTGTTCTCATTGGCCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGA 

CTGGCTTCCTCAGAGTGACCTCCTGGCTCACCCAAGCATCCGTCTGTTTGTCACCCACGGCG 

GGCAGAATAGCATAATGGAGGC CATCCAGCATGGTGTG CCCATGGTGGGGATCCCT CTCTTT 

GGAGACCAGCCTGAAAACATGGTC CGAGTAGAAGCCAAAAAGTTTGGTGTTTCTATT CAGTT 

AAAGAAGCTCAAGGCAGAGACATTGGCTCTTAAGATGAAACAAATCATGGAAGACAAGAGAT 

ACAAGTCCGCGGCAGTGGCTGCCAGTGTCATCCTGCGCTCCCACCCGCTCAGCCCCACACAG 

CGGCTGGTGGGCTGGATTGACCACGTCCTCCAGACAGGGGGCGCGACGCACCTCAAGCCCTA 

TGTCTTTCAGCAGCCCTGGCATGAGCAGTACCTGTTCGACGTTTTTGTGTTTCTGCTGGGGC 

TCACTCTGGGGACTCTATGGCTTTGTGGGAAGCTGCTGGGCATGGCTGTCTGGTGGCTGCGT 

GGGGCCAGAAAGGTGAAGGAGACATAAGGCCAGGTGCAGCCTTGGCGGGGTCTGTTTGGTGG 

GCGATGTCACCATTTCTAGGGAGCTTCCCACTAGTTCTGGCAGCCCCATTCTCTAGTCCTTC 

TAGTTATCTCCTGTTTTCTTGAAGAACAGGAAAAATGGCCAAAAATCATCCTTTCCACTTGC 

TAATTTTGCTACAAATTCATCCTTACTAGCTCCTGCCTGCTAGCAGAAATCTTTCCAGTCCT 

CTTGTCCTCCTTTGTTTGCCATCAGCAAGGGCTATGCTGTGATTCTGTCTCTGAGTGACTTG 

GACCACTGACCCTCAGATTTCCAGCCTTAAAATCCACCTTCCTTCTCATGCGCCTCTCCGAA 

TCACACCCTGACTCTTCCAGCCTCCATGTCCAGACCTAGTCAGCCTCTCTCACTCCTGCCCC 

TACTATCTATCATGGAATAACATCCAAGAAAGACACCTTGCATATTCTTTCAGTTTCTGTTT 

TGTTCTCCCACATATTCTCTTCAATGCTCAGGAAGCCTGCCCTGTGCTTGAGAGTTCAGGGC 

CGGACACAGGCTCACAGGTCTCCACATTGGGTCCCTGTCTCTGGTGCCCACAGTGAGCTCCT 

TCTTGGCTGAGCAGGCATGGAGACTGTAGGTTTCCAGATTTCCTGAAAAATAAAAGTTTACA 

GCGTTATCTCTCCCCAACCTCACTAA 
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FIGURE 160 

>< /usr/seqdb2 /sst /DNA/Dnaseqs . min/ss . DNA711 6 9 
xsubunit 1 of 1, 523 aa, 1 stop 
><MW: 59581, pi: 8.68, NX(S/T): 1 

magqrvlllvgfllpgvllseaakiltistvggshyllmdrvsqilqdhghnvtml^krgp 
fmpdfkkeeksyqviswlapedhqrefkksfd^ 

lnrkdimdslknenfdmvivetfdycpfliaeklgkpfvailstsfgslefglpiplsyvpv 
frslltdhmdfwgrvknflmffsfcrrqqhmqstfdntikehftegsrpvlshlllkaelwf 
insdfafdfarpllpntvyvgglmekpikpvpqdlenfiakfgdsgfvlvtlgsmvntcqnp 
eifkeml^afahlpqgviwkcqcshwpkdvhlaanvkivdwlpqsdllahpsirlfvthggq 
nsimeaiqhgvpmvgiplfgdqpenmtoveakkfgvsiqlkklkaetlalkmkq;medkryk 
saavaasvilrshplsptqrlvgwidhvlqtggathlkpyvfqqpwheqylfdvfvfllglt 

LGTLWLCGKLLGMAVWWLRGARKVKET 

Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 483-504 
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FTQTTRE 161 

GGGCTGTTGATTTGTGGGGGATTTTGAAGAGAGGAGGAATAGGAGGAAGGGGTTGAGGGGCT 
GCCTCTGGCATATGCACACACTCACACATT CTGTCACACC CGTCACACACACATAC CATGTT 
CTCCATCCCCCCAGGTCCAGCCCTCAGTGCTGTCCCATCCAGCAGGGCTACCCTGAAGCTCT 
GGCTGCAGCCCTCCCGTCCAGTGGGCAGGCGGCTTCATCCCTCCTTTCTCTCCCAAAGCCCA 
ACTGCTGTCACTGCATGCTCTGCCAAGGAGGAGGGAACTGCAGTGACAGCAGGAGTAAGAGT 
GGGAGGCAGGACAGAGCTGGGACACAGGTATGGAGAGGGGGTTCAGCGAGCCTAGAGAGGGC 
AGACTAT CAGGGTGCCGG CGGTGAGAATCCAGGGAGAGGAGCGGAAACAGAAGAGGGGCAGA 
AGACCGGGGCACTTGTGGGTTGCAGAGCCCCTCAGCCATGTTGGGAGCCAAGCCACACTGGC 
TACCAGGTCCCCTACACAGTCCCGGGCTGCCCTTGGTTCTGGTGCTTCTGGCCCTGGGGGCC 
GGGTGGGCCCAGGAGGGGTCAGAGCCCGTCCTGCTGGAGGGGGAGTGCCTGGTGGTCTGTGA 
GCCTGGCCGAGCTGCTGCAGGGGGGCCCGGGGGAGCAGCCCTGGGAGAGGCACCCCCTGGGC 
GAGTGGCATTTGCTGCGGTCCGAAGCCACCACCATGAGCCAGCAGGGGAAACCGGCAATGGC 
ACCAGTGGGGCCATCTACTTCGACCAGGTCCTGGTGAACGAGGGCGGTGGCTTTGACCGGGC 
CTCTGGCTCCTTCGTAGCCCCTGTCCGGGGTGTCTACAGCTTCCGGTTCCATGTGGTGAAGG 
TGTACAACCGCCAAACTGTCCAGGTGAGCCTGATGCTGAACACGTGGCCTGTCAtCTCAGCC 
TTTGCCAATGATCCTGACGTGACCCGGGAGGCAGCCACCAGCTCTGTGCTACTGCCCTTGGA 
CCCTGGGGACCGAGTGTCTCTGCGCCTGCGTCGGGGGAATCTACTGGGTGGTTGGAAATACT 
CAAGTTTCTCTGGCTTCCTCATCTTCCCTCTCTGAGGACCCAAGTCTTTCAAGCACAAGAAT 
CCAGCCCCTGACAACTTTCTTCTGCCCTCTCTTGCCCCAGAAACAGCAGAGGCAGGAGAGAG 
ACTCCCTCTGGCTCCTATCCCACCTCTTTGCATGGGACCCTGTGCCAAACACCCAAGTTTAA 
GAGAAGAGTAGAGCTGTGGCATCTCCAGACCAGGCCTTTCCACCCACCCACCCCCAGTTACC 
CTCCCAGCCACCTGCTGCATCTGTTCCTGCCTGCAGCCCTAGGATCAGGGCAAGGTTTGGCA 
AGAAGGAAGATCTGCACTACTTTGCGGCCTCTGCTCCTCCGGTTCCCCCACCCCAGCTTCCT 
GCTGAATGCTGATCAGGGACAGGTGGCGCAGGTGAGCCTGACAGGCCCCCACAGGAGCCCAG 
ATGGACAAGCCTCAGCGTACCCTGCAGGCTTCTTCCTGTGAGGAAAGCCAGCATCACGGATC 
TCAGCCAGCACCGTCAGAAGCTGAGCCAGCACCGTATGGGCTAGGGTGGGAGGCTCAGCCAC 
AGGCAGAAGGGTGGGAAGGGCCTGGAGTCTGTGGCTGGTGAGGAAGGAAGGAGGGTGTATTG 
TCTAGACTGAACATGGTACACATTCTGCATGTATAG CAGAG CAGCCAGCAGGTAGCAATC CT 
GGCTGTCCTTCTATGCTGGATCCCAGATGGACTCTGGCCCTTACCTCCCCACCTGAGATTAG 
GGTGAGTGTGTTTGCTCTGGCTGAGAGCAGAGCTGAGAGCAGGTATACAGAGCTGGAAGTGG 
ACCATGGAAAACATCGATAACCATGCATCCTCTTGCTTGGCCACCTCCTGAAACTGCTCCAC 
CTTTGAAGTTTGAACTTTAGTCCCTCCACACTCTGACTGCTGCCTCCTTCCTCCCAGCTCTC 
TCACTGAGTTATCTTCACTGTACCTGTTCCAGCATATCCCCACTATCTCTCTTTCTCCTGAT 
CTGTGCTGTCTTATTCTCCTCCTTAGGCTTCCTATTACCTGGGATTCCATGATTCATTCCTT 
CAGACCCTCTCCTGCCAGTATGCTAAACCCTCCCTCTCTCTTTCTTATCCCGCTGTCCCATT 
GGCCCAGCCTGGATGAATCTATCAATAAAACAACTAGAGAATGGTGGTCAGTGAGACACTAT 
AGAATTACTAAGGAGAAGATGCCTCTGGAGTTTGGATCGGGTGTTACAGGTACAAGTAGGTA 
TGTTGCAGAGGAAAATAAATATCAAACTGTATACTAAAATTAAAAA 



/^2 7^ 
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FIGURE 162 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71180 
xsubunit 1 of 1, 205 aa, 1 stop 
><MW: 21521, pi: 7.07, NX(S/T): 1 

MLGAKPHWLPGPLHSPGLPLVLVLLALGAGWAQEGSEPVLLEGECLVVCEPGRAAAGGPGGA 
ALGEAPPGRVAFAAVRSHHHEPAGETGNGTSGAIYFDQVLVNEGGGFDRASGSFVAPVRGVY 
SFRFHWKVYNRQTVQVSLMLNTWPVISAFANDPDVTREAATSSVLLPLDPGDRVSLRLRRG 
NLLGGWKY S S FSGFL I F PL 

Signal peptide: 

amino acids 1-32 
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FIGURE 164 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71184 
xsubunit 1 of 1, 3 88 aa, 1 stop 
><MW: 43831, pi: 9.64, NX(S/T): 3 

MKTLIAAYSGVLRGERQAEADRSQRSHGGPALSREGSGRWGTGSSILSALQDLFSVTWLNRS 
KVEKQLQVISVLQWVLSFLVLGVACSAILMYIFCTDCWLIAVLYFTWLVFDWNTPKKGGRRS 
QWVRNWAVWRYFRDYFPIQLVKTHNLLTTRNYIFGYHPHGIMGLGAFCNFSTEATEVSKKFP 

G I RP YLATLAGNFRMP VLRE YLMSGG I CPVSRDT I D YLLSKNGSGNAI I I WGGAAE S LS SM 
PGKNAVTLRNRKGFVKLALRHGADLVPIYSFGENEVYKQVIFEEGSWGRWVQKKFQKYIGFA 
PCIFHGRGLFSSDTWGLVPYSKPITTWGEPITIPKLEHPTQQDIDLYHTMYMEALVKLFDK 

HKTKFGLPETEVLEVN 

Important features of the protein: 
Transmembrane domain: 

amino acids 76-97 

N-glycosylation sites. 

amino acids 60-63, 173-176, 228-231 

N-myristoylation sites. 

amino acids 10-15, 41-46, 84-89, 120-125, 169-174, 229-234, 240- 
245, 318-323, 378-383 
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FIGURE 165 

GGGCGGCGGGATGGGGGCCGGGGGCGGCGGGCGCCGCACTCGCTGAGGCCCCGACGCAGGGCCGGGCCGGGCCCA 

GGGCCGAGGAGCGCGGCGGCCAGAGCGGGGCCGCGGAGGCGACGCCGGGGACGCCCGCGCGACGAGCAGGTGGCG 

GCGGCTGCAGGCTTGTCCAGCCGGAAGCCCTGAGGGCAGCTGTTCCCACTGGCTCTGCTGACCTTGTGCCTTGGA 

CGGCTGTCCTC^GCGAGGGGCCGTGCACCCGCTCCTGAGCAGCGC CATGG GCCTGCTGGCCTTCCTGAAGACC 

GTTCGTGCTGCACCTGCTGGTCGGCTTTGTCTTCGTGGTGAGTGGTCTGGTCATCAACTTCGTCCAGCTGTGCAC 

GCTGGCGCTCTGGCCGGTCAGCAAGCAGCTCTACCGCCGCCTCAACTGCCGCCTCGCCTACTCACTCTGGAGCCA 

AC TGGTCATG CTGCTGG AGTGGTGGTCCTGCACGGAGTGTAC ACTGTTCACG^^ 

TGGGAAGGAGCACGCAGTCATCATCCTCAACCACAACTrCGAGATCGACTTCCTCTGTGGGTGGACCA 

GCGCTTCGGAGTGCTGGGGAGCTCCAAGGTCCTCGCTAAGAAGGAGCTGCTCTACGTGCCCCTCATCGGCTGGAC 

GTGGTACrTTCTGGAGATTGTGTTCTGCAAGCGGAAGTGG<3AGGAGGACCGGGACACCGTGGTCGAAGGGCrrGAG 

GCGCCTGTCGGACTACCCCGAGTACATGTGGTTTCTCCTGTACTGCGAGGGGACGCGCTTCACGGAGACCAAGCA 

CCGCGTTAGCATGGAGGTGGCGGCTGCTAAGGGGCTTCCTGTCCTCAAGTACCACCTGCTGCCGCGGACCAAGGG 

CTTCACCACCGCAGTCAAGTGCCTCCGGGGGACAGTCGCAGCTGTCTATGATGTAACCCTGAACTTCAGAGGAAA 

CAAGAACCCGTCCCTGCTGGGGATCCTCTACGGGAAGAAGTACGAGGCGGACATGTGCGTGAGGAGATTTCCTCT 

GGAAGACATCCCG CTGGATGAAAAGGAAGCAGCTCAGTGGCTTCATAAACTGTACCAGGAG AAGGACG CGCTCCA 

GGAGATATATAATCAGAAGGGCATGTTTCCAGGGGAGCAGTTTAAGCCTGCCCGGAGGCCGTGGACCCTCCTGAA 

CTTCCTGTCCTGGGCCACCATTCTCCTGTCTCCCCTCTTCAGTTTTGTCTTGGGCGTCTTTGCCAG^CGGATCACC 

TCTCCTGATCCTGACTTTCTTGGGGTTTGTGGGAGCAGCTTCCTTTGGAGTTCGCAGACTGATAGGAGAATCGCT 

TGAACCTGGGAGGTGGAGATTGCAGTGAGCTGAGATGGCATCACTGTACTCCAGCCTAGGCAACAGAGCAAGACT 

CAGTCTCAAAAAAAAAAAAAAACAAAAAAACCCCAGAAATTCTGGAGTTGAACTGTGTAGTTACTGACATGAAAA 

ATTCACTAGAGGCTGAACAGCAGATTTGAGCAGGCAGAAAAAAATCAGCAAGCTTGAAGATGGTACCTTGAGATT 

TTTCAGGCTAATGAAAAAAGAATGAAGGAAAATTAACAGCCTCAGAGACCCATG^ 

CATATGCATGATGAGAGTCCCAGAAGGAGAGGAGAGAAAGGGTCAGAAAGAATGGCCACAAGCTGATGAAAAACA 
GT AACC TACCCACTCAGGAAGCTCAGTGAACTC CAATG AGGATGAATATCAGAGATC CACACCTAGAT ATTTCAT 
AATCAAAGTGTCAAATGACAAAG AATC TTGAAAG CAGC AAG AGATG AGCAACTTATCTTGTTCAAAGGATCTTTG 
ATCAGATTAACAGCTCATTTCTCCTCAGAAATCATGGGAGCCAGGAGATAGTGGGATGAACACTGTTGAAGGCAA 
AACCTTCAACTGTAATTATTGGACTTTTGAGTCTTAGATGGTCCTGACC^ 

ATTTAATCCCTAATAACAATTAGTCAAGCTTCCTTGACCTGTAGGAAGGCCTGTCTTTAGGCCGGGCACAGTGGC 
TTACACCTGTAATCCCAGCACTTTGGGAGGCCCAGACGGGTGGATCATTTGGGGTCAGGCTGATCTCAAACTCCT 
GAGTTCAGGTGATCTGCCCGCCTCAGCCTCCCAAAGTGTTGTGATTGCAGGCGTGAGCCACTGCGCCTGGCCGGA 
ATTTCTTTTTAAGGCTGAATGATGGGGGCCAGGCACGATGGCT 

AACATGCACCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGACGTGTTAGCCAGGCTGGTCTCGATCTCCT 
GACCTCAAGTGACCACCTGCCTCAGCCTCCCAAAGTACTGGGATTACAGGCGTGAGCCACTGTGCCTGGCCTTGA 
GCATCTTGTGATGTGCTTATTGGCCATTTGTATATCCT 

CCTTTTTAAATTTTTATTATTTATTTATTTATTTATTTTGAGACAGGGTCTTGTTCTGTTGCCCAGGCTGGAG 

CAGTGGCACAGTCTTGGCTCACTGCAGCCTCGACCTCCTGGGCT 

AGCTGTATTTTTTTGTATTTTGTATTTTGTAGCTGTAGTCT 

TGCCCAGGCTGGTCTTGAACTCCTGAGCTCAAGTGATCTGCC^ 

CATGAGCCACTGCACCTGGCAAACTCCCAAAATTCAACACAC^ 

GAGGGGCCGGGTGTGGCCCCAACTACCAGGGAGACTGAAGTGGGAGGATCGCTTGGGCATGAGAAGTCGAGGCTG 
CAGTGAGTCGAGGTTGTGCGACTGCATTCCAGCCTGGACAACAGAGTGAGACCCTGTCTC 
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FTOTTRE 166 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71213 
xsubunit 1 of 1, 368 aa, 1 stop 
><MW: 42550, pi: 9.11, NX{S/T): 1 

MGLLAFLKTQ F VLHLLVG F VFWS GL V I NF VQLCTLALWP VS KQLYRRLNCRLAYS LWSQLV 
MLLEWWSCTECTLFTDQATVERFGKEHAVIILNHNFEIDFLCGWTMCERFGVLGSSKVLAKK 
ELLYVPLIGWTWYFLEIVFCKRKWEEDRDTWEGLRRLSDYPEYMWFLLYCEGTRFTETKHR 
VSMEVAAAKGLPVLKYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFRGNKNPSLLGILYGKK 
YEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDALQEIYNQKGMFPGEQFKPARRPWTLLN 
FLSWATILLSPLFSFVLGVFASGSPLLILTFLGFVGAASFGVRRLIGESLEPGRWRLQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

Transmembrane domains : 

amino acids 307-323, 335-352 

Tyrosine kinase phosphorylation sites. 

amino acids 160-168, 161-169 
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FIGURE 167 

GATATTCTTTATTTTTAAGAATCTGAAGTACTATGCATCACTCCCTCCAATGTCCTGGGGCA 
GCCACCAGGCATATTCATCTTTGTGTGTGTTTTTCTTTTGCTTTAGCACTGGGGCACTTCTT 
GCTTATTTCTTTGGTAGGAAAGGGGCTCAGTTTGTCTTGTGGGGTTGGTGGCAGGCAGGCCG 
GCTTACGCCTGATACGGCCCTGGGTTAGAAGGGAAGGGAAGATAAACTTTTATACAAATGGG 
GATAGCTGGGGTCTGAGACCTGCTTCCTCAGTAAAATTCCTGGGATCTGCCTATACCTTCTT 
TTCTCTAACCTGGCATACCCTGCTTAAAGCCTCTCAGGGCTTCTCTCTGTTCTTAGGATCAA 
AGTATTTAGAGCTACAAGAGCCCTCATGGTCTGGCCCCTGCCCCCCTGGCCAGCTTCATTGT 
ACATGTGGTGTTCTCTTGTCGTTCCTGTAATGTGGTATGCCATGGGGTCTTTGCACAAGCCT 
TTCCTCTTTGGCTGGACACTGTTCCCTGCCCCCCCCATACTCTTCCTACTTAATATGTAGTC 
ATCCTGCAGATTTCAATTCTAACATCATTTTCTCCAGGGATCCTGGCCTGACAGAATCTCAT 
CTTGTTTAATGCTCTCATAAGACCACTTGTTTCCCTTTTGCAGCACTTGCCACTCAGTTGTA 
TCTTTATGTGCGTTTGTGGTTGTATGGGTTGTGTCTGTTCCCCAGAATGCCCAGCTCTGAGC 
TGCGTGAGGGTCAAGGGCATTGCTGTGCCTGCCAGGTATAGTGCCTACATGTGGTGGGTGCT 
CATGTTTTAGAGACTAAATGGAGGAGGAGATGAGGAAAAGATTGAAATCTCTCAGTTCACCA 
GATGGTGTAGGGCCCAGCATTGTAAATTCACACGTTGACTGTGCTTGTGAATTATCTGGGGA 
TG CAGGTCCTGATT CAGTAGGCCCAGGTTGGGCATCTCTAACAAACTC C C ACGTGATGCTGA 
TGCTGGTCCTATGAACTATACTAAATAGTAAGAATCTATGGAGCCAGGCTGGGCATGGTGGC 
TCACACCTATGATCCCAGCACTTTGGGAGGCTGAGGCAGGCTGATCACCTGGAGTCAGGATT 
TCAAGACTAGCCTGGCCAACATGGTGGAACCCCATCTGTACTAAAAATACACAAATTAGCTG 
GG CATGGTGGCACATG CCTGTAGTCC CAGCTACTTGGGAGG CTGAAGCAAGAGAATCGCTTG 
AACCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATCAGGCCACTGTATTCCAACCAGGGTGAC 
AGAGTGAGACTCTATGTCCAAAAAAAAAAAAA 



Uf/2.7d 



WO 00/12708 



PCT/US99/20UI 



FTGTJRE 168 

>< /usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA71234 
xsubunit 1 of 1, 143 aa, 1 stop 
><MW: 15624, pi: 9.58, NX(S/T): 0 

MHH S LQ C PGAAT RH IHLC VC F S F ALALGHF LL I S LVGKGL S L S CG VGGRQ AGLRL I R P WVRR 
EGKINFYTNGDSWGLRPASSVKFLGSAYTFFSLTWHTLLKASQGFSLFLGSKYLELQEPSWS 
GPCPPGQLHCTCGVLLSFL 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 



WO 00/12708 



PCT/US99/20111 



FIGURE 169 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGA 
TTATAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCAC 
AGGAGTTGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTAC 
ATTAAAATCTGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGT 
CACTGTGGCTCTGGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCAC 
ACCGTCCCCTCGAAGCCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTC 
ACCAACTGTCTCACGTCTGGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTA 
GCTGCGGCTTTCAAGGTGGGCCTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTT 
CATAGGCGATGGCTCCCACTGCCCAGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGG 
CCAGGACGGGCCGTGGACACCTGCTCAGAAGCAGTGGGTGAGACATCACGCTGCCCGCCCAT 
CTAACCTTTTCATGTCCTGCACATCACCTGATCCATGGGCTAATCTGAACTCTGTCCCAAGG 
AACCCAGAGCTTGAGTGAGCTGTGGCTCAGACCCAGAAGGGGTCTGCTTAGACCACCTGGTT 
TATGTGACAGGACTTGCATTCTCCTGGAACATGAGGGAACGCCGGAGGAAAGCAAAGTGGCA 
GGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATGGAGGTGTTGGGTTATCACAAGGC 
ATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTGCCGATGGCGCAT6ACACACT 
CGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGATCCACGTACCAGCTGCTG 
AAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAATCTGCGATCACCAG 
CCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCCCTCCTTCCCTC 
TGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGGGGCTAATG 
GCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCTGTGCG 
AACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGGCT 
CAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGT 
TCAAATGATCTCCAAGGG C C CTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAA 
TCCAAACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATG 
CCAACATTTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTG 
GCCAACATGGTGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGT 
GTGTGCCTGTATCCCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGT 
GAAGGAGGCTGAGACAGGAGAATCACTTCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGA 
AAAAATAAAAAAAGAATTATGGTTATTTGTAA 



/7//270 



WO 00/12708 



PCTAJS99/20111 



FTOTTRR 170 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71277 
xsubunit 1 of 1, 109 aa, 1 stop 
><MW: 11822, pi: 8.63, NX(S/T): 0 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLVJPKPDLHSGTRTEVSTHTVPSKPGTA 
SPCWPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



WO 00/12708 



PCT/US99/20111 



FIGURE 171 

GCGGGCCCGCGAGTCCGAGACCTGTCCCAGGAGCTCCAGCTCACGTGACCTGTCACTGCCTC 
CCGCCGCCTCCTGCCCGCGCCATGACCCAGCCGGTGCCCCGGCTCTCCGTGCCCGCCGCGCT 
GGCCCTGGGCTCAGCCGCACTGGGCGCCGCCTTCGCCACTGGCCTCTTCCTGGGGAGGCGGT 
GCCCCCCATGGCGAGGCCGGCGAGAGCAGTGCCTGCTTCCCCCCGAGGACAGCCGCCTGTGG 
CAGTATCTTCTGAGCCGCTCCATGCGGGAGCACCCGGCGCTGCGAAGCCTGAGGCTGCTGAC 
CCTGGAGCAGCCGCAGGGGGATTCTATGATGACCTGCGAGCAGGCCCAGCTCTTGGCCAACC 
TGGCGCGGCTCATCCAGGCCAAGAAGGCGCTGGACCTGGGCACCTTCACGGGCTACTCCGCC 
CTGGCCCTGGCCCTGGCGCTGCCCGCGGACGGGCGCGTGGTGACCTGCGAGGTGGACGCGCA 
GCCCCCGGAGCTGGGACGGCCCCTGTGGAGGCAGGCCGAGGCGGAGCACAAGATCGACCTCC 
GGCTGAAGCCCGCCTTGGAGACCCTGGACGAGCTGCTGGCGGCGGGCGAGGCCGGCACCTTC 
GACGTGGCCGTGGTGGATGCGGACAAGGAGAACTGCTCCGCCTACTACGAGCGCTGCCTGCA 
GCTGCTGCGACCCGGAGGCATCCTCGCCGTCCTCAGAGTCCTGTGGCGCGGGAAGGTGCTGC 
AACCTCCGAAAGGGGACGTGGCGGCCGAGTGTGTGCGAAACCTAAACGAACGCATCCGGCGG 
GACGTCAGGGTCTACATCAGCCTCCTGCCCCTGGGCGATGGACTCACCTTGGCCTTCAAGAT 
CTAGGGCTGGCCCCTAGTGAGTGGGCTCGAGGGAGGGTTGCCTGGGAACCCCAGGAATTGAC 
CCTGAGTTTTAAATTCGAAAATAAAGTGGGGCTGGGACACAAAAAAAAAAAAAAAAAAA 



WO 00/12708 PCT/US99/201 1 1 

FTGTTRE 172 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA71282 
xsubunit 1 of 1, 262 aa, 1 stop 
><MW: 28809, pi: 8.80, NX(S/T): 1 

MTQPVPRLSVPAALALGSAALGAAFATGLFLGRRCPPWRGRREQCLLPPEDSRLWQYLLSRS 
MREHPALRSLRLLTLEQPQGDSMMTCEQAQLLANLARLIQAK 

PADGRWTCEVDAQPPELGRPLWRQAEAEHKIDLRLKPALETLDELLAAGEAGTFDVAWDA 
DKENCSAYYERCLQLLRPGGILAVLRVLWRGKVLQPPKGDVAAECVRNLNERIRRDVRVYIS 
LLPLGDGLTLAFKI 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 5 < 

Transmembrane domains : 

amino acids 8-30, 109-130 

N-glycosylation site, 
amino acids 190-193 

Tyrosine kinase phosphorylation site. 

amino acids 238-246 

N-myristoylation sites. 

amino acids 22-27, 28-33, 110-115, 205-210, 255-260 

Ami da t ion sites. 

amino acids 31-34, 39-42 



WO 00/12708 



PCT/US99/20111 



FIGURE 173 

CCGCCGCCGCAGCCGCTACCGCCGCTGCAGCCGCTTTCCGCGGCCTGGGCCTCTCGCCGTCA 
GCATOCCACACGCCTTCAAGCCCGGGGACTTGGTGTTCGCTAAGATGAAGGGCTACCCTCAC 
TGGCCTGCCAGGATCGACGACATCGCGGATGGCGCCGTGAAGCCCCCACCCAACAAGTACCC 
CATCTTTTTCTTTGGCACACACGAAACAGCCTTCCTGGGACCCAAGGACCTGTTCCCCTACG 
ACAAATGTAAAGACAAGTACGGGAAGCCCAACAAGAGGAAAGGCTTCAATGAAGGGCTGTGG 
GAGATCCAGAACAACCCCCACGCCAGCTACAGCGCCCCTCCGCCAGTGAGCTCCTCCGACAG 
CGAGGCCCCCGAGGCCAACCCCGCCGACGGCAGTGACGCTGACGAGGACGATGAGGACCGGG 
GGGTCATGGCCGTCACAGCGGTAACCGCCACAGCTGCCAGCGACAGGATGGAGAGCGACTCA 
GACTCAGACAAGAGTAGCGACAACAGTGGCCTGAAGAGGAAGACGCCTGCGCTAAAGATGTC 
GGTCTCGAAACGAGCCCGAAAGGCCTCCAGCGACCTGGATCAGGCCAGCGTGTCCCCATCCG 
AAGAGG AGAACTCGGAAAGCTCAT CTG AGT CGGAGAAGACCAGCGACCAGGACTTCACACCT 
GAGAAGAAAGCAGCGGTCCGGGCGCCACGGAGGGGCCCTCTGGGGGGACGGAAAAAAAAGAA 
GGCGCCGTCAGCCTCCGACTCCGACTCCAAGGCCGATTCGGACGGGGCCAAGCCTGAGCCGG 
TGGCCATGGCGCGGTCGGCGTCCTCCTCCTCCTCTTCCTCCTCCTCCTCCGACTCCGATGTG 
TCTGTGAAGAAGCCTCCGAGGGGCAGGAAGCCAGCGGAGAAGCCTCTCCCGAAGCCGCGAGG 
GCGGAAACCGAAGCCTGAACGGCCTCCGTCCAGCTCCAGCAGTGACAGTGACAGCGACGAGG 
TGGACCGCATCAGTGAGTGGAAGCGGCGGGACGAGGCGCGGAGGCGCGAGCTGGAGGCCCGG 
CGGCGGCGAGAGCAGGAGGAGGAGCTGCGGCGCCTGCGGGAGCAGGAGAAGGAGGAGAAGGA 
GCGGAGGCGCGAGCGGGCCGACCGCGGGGAGGCTGAGCGGGGCAGCGGCGGCAGCAGCGGGG 
ACGAGCTCAGGGAGGACGATGAGCCCGTCAAGAAGCGGGGACGCAAGGGCCGGGGCCGGGGT 
CCCCCGTCCTCCTCTGACTCCGAGCCCGAGGCCGAGCTGGAGAGAGAGGCCAAGAAATCAGC 
GAAGAAGCCGCAGTCCTCAAGCACAGAGCCCGCCAGGAAACCTGGCCAGAAGGAGAAGAGAG 
TGCGGCCCGAGGAGAAGCAACAAGCCAAGCCCGTGAAGGTGGAGCGGACCCGGAAGCGGTCC 
GAGGGCTTCTCGATGGACAGGAAGGTAGAGAAGAAGAAAGAGCCCTCCGTGGAGGAGAAGCT 
GCAGAAGCTGCACAGTGAGATCAAGTTTGCCCTAAAGGTCGACAGCCCGGACGTGAAGAGGT 
GCCTGAATGCCCTAGAGGAGCTGGGAACCCTGCAGGTGACCTCTCAGATCCTCCAGAAGAAC 
ACAGACGTGGTGGCCACCTTGAAGAAGATTCGCCGTTACAAAGCGAACAAGGACGTAATGGA 
GAAGGCAGCAGAAGTCTATACCCGGCTCAAGTCGCGGGTCCTCGGCCCAAAGATCGAGGCGG 
TGCAGAAAGTGAACAAGGCTGGGATGGAGAAGGAGAAGGCCGAGGAGAAGCTGGCCGGGGAG 
GAGCTGGCCGGGGAGGAGGC CCC CCAGGAGAAGGCGGAGGACAAG C CCAGCACCGATCTCTC 
AGCCCCAGTGAATGGCGAGGCCACATCACAGAAGGGGGAGAGCGCAGAGGACAAGGAGCACG 
AGGAGGGTCGGGACTCGGAGGAGGGGCCAAGGTGTGGCTCCTCTGAAGACCTGCACGACAGC 
GTACGGGAGGGTCCCGACCTGGACAGGCCTGGGAGCGACCGGCAGGAGCGCGAGAGGGCACG 
GGGGGACTCGGAGGCCCTGGACGAGGAGAGCTGAGCCGCGGGCAGCCAGGCCGAGCCCCCGC 
CCGAGCTCAGGCTGCCCCTCTCCTTCCCCGGCTCGCAGGAGAGCAGAGCAGAGAACTGTGGG 
GAACGCTGTGCTGTTTGTATTTGTTCCCTTGGGTTTTTTTTTCCTGCCTAATTTCTGTGATT 
TCCAACCAACATGAAATGACTATAAACGGTTTTTTAATGA 



WO 00/12708 



PCT/US99/201I1 



FTGIJRE 174 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA71286 
x'subunit 1 of 1, 671 aa, 1 stop 
><MW: 74317, pi: 7.61, NX(S/T): 0 

MPHAFKPGDLVFAKMKGYPHWPARIDDIADGAVKPPPNKYPIFFFGTHETAFLGPKDLFPYD 
KCKDKYGKPNKRKGFNEGLWEIQNNPHASYSAPPPVSSSDSEAPEANPADGSDADEDDEDRG 
VMAVTAVTATAASDRMESDSDSDKSSDNSGLKRKTPALKMSVSKRARKASSDLDQASVSPSE 
EENSESSSESEKTSDQDFTPEKKAAVRAPRRGPLGGRKKKKAPSASDSDSKADSDGAKPEPV 
AMARSASSSSSSSSSSDSDVSVKKPPRGRKPAEKPLPKPRGRKPKPERPPSSSSSDSDSDEV 
DRISEWKRRDEARRRELEARRRREQEEELRRLREQEKEEKERRRERADRGEAERGSGGSSGD 
ELREDDEPVKKRGRKGRGRGPPSSSDSEPEAELEREAKKSAKKPQSSSTEPARKfGQKEKRV 
RPEEKQQAKPVKVERTRKRSEGFSMDRKVEKKKEPSVEEKLQKLHSEIKFALKVDSPDVKRC 
LNALEELGTLQVTSQILQKNTDWATLKKIRRYKANKDVMEKAAEVYTRLKSRVLGPKIEAV 
QKVNKAGMEKEKAEEKLAGEELAGEEAPQEKAEDKPSTDLSAPVNGEATSQKGESAEDKEHE 
EGRDSEEGPRCGSSEDLHDSVREGPDLDRPGSDRQERERARGDSEALDEES 

Signal peptide: 

amino acids 1-13 



WO 00/12708 



PCT/US99/20U1 



FIGURE 175 

G TTGG TTCTC CTGG ATCTTC ACCTT ACCAACTG CAG ATCTTGGG ACTCAT CAG C CTCAAT AATT AT ATT AAATT A 
ACACCATTTGAAAG AGAACATTG T TT T C AT CATG AATG CT AAT AAAG ATG AAAGACTT AAAG CCAG AAGC CAAGA 
TTTTCAC CTTTTTC CTG CTTTGATG ATG CTAAG CATG AC CATGTT GTT T CTT C CAG TCAC TG G CACTTTG AAGCA 
AAATATTC CAAGA CTCAAG C T AAC C T AC AAAG ACTTG CTG CTTTC AAAT AG CTGT ATTC C CTTTTTGGG TTCAT C 
AGAAGGACrrcGATTTTCAAACTCTTCTCTTAGATGAGGAAAGAGGCAG^CTGCTCTTGGGAG 
CTTTCTACTCAGTCTGGTTGACTTAAACAAAAATTTTAAGAAGATTTATTGGCCTGC 
ATTATGTAAATTAGCTGGGAAAGATGCCAATACAGAATGT<3Cy\AATT^ 
AACTCACATATATGTGTGTGGAACTGGAGCATTTCATCCAATATGTGGGT^ 

GGATATTATATTCAAACTAGACACACATAATTTGK3AGTCTGGCAGACTGAAATGTCCTTTCGATCCTCAGCAGCC 

TTTTGCTTCAGTAATGACAGATGAGTACCTCTACTCTGGAACAGCTTCTGATTTCCTTGGCA 

C ACTCGATCCCTTGGGCCTACTCATGACCAC CACTAC AT CAGAACTG ACATTT CAGAG CACT ACTGGCTCAATGG 

AGCAAAATTTATTGGAACTTTCTTCATACCAGACACCTAC^^ 

TGAATCATCTCAAGAAGGCIAGTACCTCCGATAAAACCATCCTTTCTCGAGTTGGAAGAGTTTGTAAGAATGATGT 
AGGAGGACAACGCAGCCTGATAAACAAGTGGACGACTTTTCTTAAGGCCAGACTGATTTGCTCAATTCCTGGAAG 
TGATGGGGCAGATACTTACTTTGATGAGCTTCAAGATATTTATTTACTCCCCACAAGAGATGAAAGAAATCCTGT 
AGT AT ATGG AGTC TTTACT ACAAC CAGCTCC ATCTTCAAAGGCTCTG CTGTTTGTGTGTATAGCATGGCTG ACAT 
CAGAGCAGTTTTTAATGGTCCATATGCrCATAAGGAAAGTGCAGACC^ 

TCCTTATCCACGGCCTGGTACATGTCCAAGCAAAACCTATGACCCACTGATTAAGTCCACCCGAGATTTTCCAGA 

TGATGTCATCAGTTTCATAAAGCGGCACTCTGTGATGTATAAGTCCGTATACCCAGTTGCAGGAGGACCAACGTT 

CAAGAGAATCAATGTGGATTACAGACTGACACAGATAGTGGTGGATCATGTCATTGCAGAAGATGGCCAGTACGA 

TGTAATGTTTCTTGGAACAGACATTGGAACTGTCCTCAAAGTTGTCAGCATTTCAAAGGAA 

AGAGGTAGTGCTGGAGGAGTTGCAGATATTCAAGCACTCATC^TCATCTTGAACATGGAATTGTCTCTGAAGCA 

GCAACAATTGTACATTGGTTCCCGAGATGGATTAGTTCAGCTCTCCTTGCACAGATGCGACACTTATGGGAAAGC 

TTGCGCAGACTGTTGTCTTGCCAGAGACCCCTACTGTGCCTGGGATGGAAATGCATGCTCTCGATATGCTCCTAC 

TTC^AAAAGGAGAGCTAGACGCCAAGATGTAAAATATGGCGACCCAATCACCCAGTGCTGGGACATCGAAGACAG 

CATTAGTCATGAAACTGCTGATGAAAAGGTGATTTTTGGCATTGAATTTAACTCAACCTTTCTGGAATGTATACC 

TAAATCCCAACAAGCAACTATTAAATGGTATATCCAGAGGTCAGGGGATGAGCATCGAGAGGAGTTGAAGCCCGA 

TGAAAGAATCATCAAAACGGAATATGGGCTACTGATTCGAAGTTTGCAGAAGAAGGATTCTGGGATGTATTACTG 

CAAAGCCCAGGAGCACACTTTCATCCACACCATAGTGAAGCTGACTTTGAATGTCATTGAGAATGAACAGATGGA 

AAATACCCAGAGGGC^GAGCATGAGGAGGGGCAGGTCAAGGATCTATTGGCTGAGTCACGGTTGAGATACAAAGA 

CTACATCCAAATCCTTAGCAGCCCAAACTTCAGCCTCGACCAGTACTGCGAACAGATGTGGCACAG 

GAGACAGAGAAACAAGGGGGGCCCAAAGTGGAAGCACATGCAGGAAATGAAGAAGAAACGAAATCGAAGACATCA 

CAGAGACCTGGATG AG CTCCCTAGAGCTG TAGCC ACGTAGTTTTCTACTCAATTTAAAG AAAAGAATTCCTTACC 

TATAAAAACATTGCCTTCTGTTTTGTATATCCCTTATAGTAATTCATAAATGCTTCCCATGGAGTTTTGCTAAGG 

CACAAGACAATAATCTGAATAAGACAATATGTGATGAATATAAGAAAGGGCAAAA^ 

CCAAGAAGAAATCTTGCACAAGCAAAGTATAAGAATTATCCTAAAAA 

TGTTTTGAGTTTTGGAATTTATTG TCATGTAAATAGTTGAGCT AAGCAAGCC CCGAATTTGATAGTGTATAAGGT 

GCTCTATTCCCTCGAATGTCCATTAAGCATGGAATTTACCA^ 

CATTCCTATTGAGAACCAGCTACCTTGTGGTAGGGAATAAGAGGTCAGACACAA^ 

AACAGGAACTTTCTCAGTGAGCCATTCACTCCTGGAGAATGGTATAGGAATTTGGAGAGGTGCATTATTTCTTTC 
TGGCC ACTGGGGTTAAATTT AGTG TACT ACAACATTGATTTACTGAAGGG CACT AATG TTTC C C CC AGG ATT TC T 
ATTGACTAGTCAGGAGTAACAGGTTCACAGAGAGAAGTTGGTGCTTAGTTAT^ 
v GCTCTACAGGGACAGAATGCTTAATAAATACTTTAATAAGATATGGGAAAATATTTTAATAAAAC^GGAAAAC^ 
TAATGATGTATAATGCATCCTG ATGGGAAGGC ATG CAG ATGGGATTTGTTAG AAGACAGAAGGAAAGAC AG CCAT 
AAATTCTGG CTTTGGGGAAAAC TC AT ATC C C C ATG AAAAGG AAG AACAAT CACAAAT AAAGTG AGAGTAATGT AA 
TGGAGCTCTTTTCACTAGGGTATAAGTAGCTGCCAATTTGTAATT^ 
AACTGCTAGCAAAATCTCAGGAAACATAAATTCTTCTGAAGAATCATAGG 

AATGATATTTCAGTATATATTTT CT CTCTTTTAAAAAATATT^ATCATACTCTGTATATTATTTCTTTOTACTGC 
CTTTATTCTCTCCTGTATATTGGATTTTGTGATTATATTTGA 

GAATTAAGAAAATGACATTTCTGGGGAGTGGGGATATATATTTGTTGAATAACAGAACGAGTGTAAAATT^ 
AACGGAAAGGGTTAAATTAACTCTTTGAC ATCTTCACTCAAC CTTTTCTCATTG CTGAGTT AATCTGTTGT AATT 
GTAGTATTGTTTTTGTAATTTAACAATAAATAAGCCTGCTACATGT 



WO 00/12708 



PCT/US99/20U1 



FIGURE 176 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA71883 
xsubunit l of ly 777 aa, l stop 
><MW: 89651, pi: 7.97, NX(S/T): 3 

MNANKDERLKARSQDFHLFPALMMLSMTMLFLPVTGTLKQNIPRLKLTYKDLLLSNSCIPFL 
GSSEGLDFQTLLLDEERGRLLLGAKDHIFLLSLVDLNKNFKKIYWPAAKERVELCKLAGKDA 
NTECANFIRVLQPYNKTHIYVCGTGAFHPICGYIDLGVYKEDIIFKLDTHNLESGRLKCPFD 
PQQ PFAS VMTDE YL YSGTAS DF LGKDTAFTRS LGPTHDHHYI RTD I SEHYWLNG AKF I GT FF 
IPDTYNPDDDKIYFFFRESSQEGSTSDKTILSRVGRVCKNDVGGQRSLINKWTTFLKARLIC 
S I PGSDGADTYFDELQDI YLLPTRDERKPWYGVFTTTSS I FKGSAVCVYSMAD IRAVFNGP 
YAHKESADHRWVQYDGRIPYPRPGTCPSKTYDPLIKSTRDFPDDVISFIKRHSVMYKSVYPV 
AGGPTFKRINVDYRLTQIVVDHVIAEDGQYDVMFLGTDIGTVLKWSISKEKWNMEEVVLEE 
LQ I FKHSS 1 1 LNMELSLKQQQL YI GSRDGL VQLS LHRCDT YGKACADCCLARDP YCAWDGNA 
CSRYAPTSKRRARRQDVKYGDPITQCWDIEDSISHETADEKVIFGIEFNSTFLECIPKSQQA 
TIKWYIQRSGDEHREELKPDERIIKTEYGLLIRSLQKKDSGMYYCKAQEHTFIHTIVKLTLN 
VIENEQMENTQRAEHEEGQVKDLLAESRLRYKDYIQILSSPNFSLDQYCEQMWHREKRRQRN 
KGGPKWKHMQEMKKKRNRRHHRDLDELPRAVAT 

Important features of the protein: 
Signal peptide: 

amino acids 1-3 6 

N-glycosylation sites. 

amino acids 139-142, 607-610, 724-727 

Tyrosine kinase phosphorylation site. 

amino acids 571-576 

Gram-positive cocci surface proteins 'anchoring 1 hexapeptid . 

amino acids 3 2-37 



WO 00/12708 



PCT/US99/20I11 



FTGTJRE 177 

CCCTGACCTCCCTGAGCCACACTGAGCTGGAAGCCGCAGAGGTCATCCTGGAGCATGCCCACCGCGGGGAGCAGA 

CAACCTCCCAGGTAAGCTGGGAGCAAGACCTGAAGCTGTTTCTTCAGGAGCCTGGTGTAriTT 

C TC AG C AGTTT C AG CCAGCAGGG AC TG ATC AGGTGTGTGTC CTG G AG TG GGGAG CAGAAGGCGTGG CTGG CAAGA 

GTGGCCTGGAGAAAGAGGTTC^GCGCTTGACC^GCCGAGCTGCCCGTGACTAC^ 

GGGTGAGGTGGGGGGGCACAGGTGTCATGTGCACCTTCTTC 

AGCCATTGAGGGTGTCATGGAGCTACAGAGGGGAG^GAAAGGTATTTTAAGGTAACAGTGTC 

GAGCACAGTTTTTGGAGCTAGACCGACATAGGTTCAAATTCTCTTCTGTTGCTTCCTA 

AAGGGAGTGACTTAACCTCTCTGGACTTCAATTTCCTCATGACT^ 

AGGGAAGATTAAATGACATAATGTATGTG ATG CAACTAGCAAAGTACCAGTCCCATAGTAAGTCATGCCCCACAG 
TATTTCCACCCACCCCTGTTCTCTGCCTTCCOU^CCAGGTACTGCAACGACTGGAGCAGAGGCGGCAGCA 
CAGAGCGGGAGGCTCCAAGCATAGAACAGAGGTTACAGGAAGTGCGAGAGAGCATCCGCCGGGCACAGGTGAGCC 
AGGTGAAGGGGGCTGCCCGGCTGGCCCTGCTGCAGGGGGCTGGCTTAGATGTGGAGCGCTGGCTGAAGCCAGCCA 
TGACCCAGGCCCAGGATGAGGTGGAGCAGGAGCGGCGGCTCAGTGAGGCTCGGCTGTCCCAGAGGGACCTCTCTC 
CAACCGCTGAGGATGCTGAGCTTTCTGACTTTGAGGAATGTGAGGAGACGGGAGAGCTCTTTGAGGAGCCTGCCC 
CCC^GCCCTGGCCACGAGGGCCCTCCCCTGCCCTGCACACGTGGTATTTCGCTATCAGGCAGGGCGTGAGGATG 
AGCTGACAATCACGGAGGGTGAGTGGCTGGAGGTCATAGAGGAGGGAGATGCTGACGAATGGGTCAAGGCTCGGA 
ACCAGCACGGCGAGGTAGGCTTTGTCCCTGAGCGATATCTCAACTTCCCGGACCTCTCCCTCCCAGAGAGCAGCC 
AAGACAGTGACAATCCCTGCGGGGCAGAGCCCACAGCATTCCTGGCACAGGCCCTGTACAGCTACACCGGACAGA 
. GTGCAGAGGAGCTG AG CTTCCCTG AGGGGGC ACTCATCCGTCTGCTG CCCCGGGC CCAAG ATGGAGTAGATG ACG 
GCTrCTGGAGGGGAGAATTTGGGGGCCGTGTTGGGGTCTTCCCCTCCCTGCTGGTGGAAGAGCTGCTTGGCCCCC 
CAGGGCCACCTGAACTCTCTGACCCTGAACAGATGCTGCCGTCCCCTTCTCCTCCCAGCTTCTCCCCACCTGCAC 
CTACCTCTGTGTTGGATGGGCCCCCTGCACCTGTCCTGCCTGGGGAOiJ^GCCCrrGGACTTCCCTGGGTTCCTGG 
ACATGATGGCACCTCGACTCAGGCCGATGCGTCCACCACCTCCCCCGCCGGCTAAAGCCCCGGATCCTGGCCACC 
CAGATCCCCTCACCTGAAGGCCAGGGAAGCCTTGACCCCCAGTGATGCTGCTGTCCCTATCTTCAAGCTGTCAGA 
CCACACCATC^TGATCCAGAGCAACACAGCC^AAAGCTGGAATCGCCCTTATTTCCACCCTCACCTCCAAGGGT 
GGAAACTTGCCCCTTCCCATTTCTAGAGCTGGAACCCACTCCrrTTTTTCCCATTGTTCTATCATCTCTAGGACC 
GGAACTACTACCTTCTCTTCTGTCATGACCCTATCTAGGGTGGTGAAATGCCTGAAATCTCTGGGGCTGGAAACC 
ATCCATCAAGGTCTCTAGTAGTTCTGGCCCACCTCTTTCCCCACCCTGGCTCCATGACCCACCCCACTCTGGATG 
CCAGGGTCACTGGGGTTGGGCTGGGGAGAGGAACAGGCCTTGGGAATCAGGAGCTGGAGCCAGGATGCGAAGCAG 
CTGTAATGGTCTGAGCGGATTTATTGACAATGAATAAAGGGCACGAAGGCCAGGCCAGGGCCTGGGCCT 
CTAAGAGGGCAGGGGGCCTACGGTG CTATTGCTTTAGGGGCCCACCACGGGCAGGGG CCTGCTC CCAGCTGCCAC 
GCTCTATCATATGGAGCGAGGTGTTGGGGAAGGCGGGGCAGGCAGCCTGTTGCAGGCAGGGGAAGGAGAAGAGAC 
TGAGGGGCTGTGACCTCTCCTGAGGCCCCCAGCCTGAGACTGTGCAACTCCAGGTGGAAGTAGAGCTGGTCCCTC 
AGCTGGGGGGCAGTGCTGTCCAGTGGAGGGGAGGGCTTTCACGCCCACCCACCCCCTGGCCCTGCCAGCTGGTAG 
TC CATC AG CAC AATG AAGGAGACTTGGAG AAG AGG AAG AAT AAC ACTGTTG CTT CC TGTT CAAG C TG TGTCC AG C 
TTTTCCCCTGGGGCTCCAGGACCTTCCCnACCTCCACCACCAA 

AGTTTACTCTGGGGGTTCAGGGAGCCGAAAGGCTTAAATAGTTTAAGTAGGTGATGGGAAGATGAGATTACCTC^ 
TTTAGGGCTCAGGCAGACTCACCTCACATACTCCCTC^ 

TCAACAATG AGAGACC AGGAGTAGGTCCTATCAGTGCCC CCCAGAGTAGAGAG CAATAAGAGCCCAGCCCAGTGC 

AGTCCCGGCTGTGTTTTCCTACCTGGTGATCAGAAGTGTCTGGTTTGCTTGGCTGCCCATTTGCCTCTT 

GCAGCCCTGGGCTTGGGCCCCTCCCTCCGGCCCTCAGTGTTGGCT^ 

C ACG AGGGGTTAGGCTG CTGTCC CT G AGTCCTCCATTCTGTACTGGGGX3GCTCGCTAGG ACCTGGGGCTGTGGCC 
TCTCAGGGGGCAGCCTCTCCATGGCAGGCATCCCTGCCTTGGGCTGCCCTCCCCCAGACCCCTGACCACCCCCTG 
GGTCCTGTCCCCCACCAGAGCCCCAGCTCCTGTCTGTGGGGGAGCCATCACGGTGTTCGTGCAGTCCATAGCGCT 
TCTCAATGTGTGTCACCCGGAACCTGGGAGGGGAGGGAACACTGGGGTT^^ 

GCCCTCCCCTCTGACCAGGGACATCCTGAGTTTGGTGGCTACTTCCCTCTGGCCTAAGGTAGGGGAGGCCTTCTC 
AGATTGTGGGGCACATTGTGTAGCCTGACTTCTGCTGGAGCTCCCAGTCCAG<^GGAAAGAGCCAAGGCCCACTT 
TTGGGATCAGGTGCCTGATCACTGGGCCCCCTACCTCAGCCCCCCTTTCCCTGGAGC^CCTGCCCCACCTGCCCA 
CAGAGAACAC^GTGGTCTCCCCTGTCCGGGGGCGGCTTT 

GCCTCTTGCTGCGGCTGCAATGGATGCAAGGGGCTGCAGAGCCC^GGTGCACTGTGTGATGATGGGAGGGG 
CGTCCTGCAGGCTGGAGGTGGCATCCACACTGGACAGCAGGAGGAGGGGAGTGAGGGTAACATTTCCATTTCCCT 
TCATGTTTTGTTTCTTACGTTCTTTCAGCATGCTCCTTAAAACCCCAGAAGCCC 
TTT CTTGT CTTT AT CTAATAAACT C AAT ATT AAG 
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FTOTTRE 178 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA73401 
xsubunit 1 of 1, 370 aa, 1 stop 
><MW: 40685, pi: 4.53, NX(S/T): 0 

MQLAKYQSHSKSCPTVFPPTPVLCLPNQVLQRLEQRRQQASEREAPSIEQRLQEVRESIRRA 
QVSQVKGAARLALLQGAGLDVERWLKPAMTQAQDEVEQERRLSEARLSQRDLSPTAEDAELS 
DFEECEETGELFEEPAPQALATRALPCPAHWFRYQAGREDELTITEGEWLEVIEEGDADEW 
VKARNQHGEVGFVPERYLNFPDLSLPESSQDSDNPCGAEPTAFLAQALYSYTGQSAEELSFP 
EGALIRLLPRAQDGVDDGFWRGEFGGRVGVFPSLLVEELLGPPGPPELSDPEQMLPSPSPPS 
FSPPAPTSVLDGPPAPVLPGDKALDFPGFLDMMAPRLRPMRPPPPPPAKAPDPGHPDPLT 
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FIGURE 179A 

CACAGGGAGACCCACAGACACATATGCACGAGAGAGACAGAGGAGGAAA.GAGACAGAGACAAAGGCACAGCGG^ 
GAAGGCAGAGACAGGGCAGGCACAGAAGCGGCCCAGACAGAGTCCTACAGAGGGAGAGGCCAGAGAAGCTGCAGA 
AGACACAGGCAGGGAGAGACAAAGATCCAGGAAAGGAGGGCTCAGGAGGAGAGTTTGGAGAAGCCAGACCCCTGG 
GCACCTCTCCCAAGCCCAAGGACTAAGTTTT^ 

TCTGACCTTGGCAGGAGTCCAAGCCCCCAGGCTACAGAGAGGAGCTTTCCAAAGCTAGGGTGTGGAGGACTTGGT 

GCCCTAGACGGCCTCAGTCCCTCCCAGCTGCAGTACCAGTGCCATGTCCCAGACAGGCTCGCATCCCGGGAGGGG 

CTTX3GCAGGGCGCTGGCTGTGG<3GAGCCCAACCCT^ 

GCTGCTTCTCCTACTGCTXjGCCTCTCT^ 

CGTGTTTCCAGAGAAGCTCAACGGCAGCGTCCTGCCTGGCTCGGGCGCCCCTGCCAGGCTGTTGTGCCGCTTGCA 
GGCCTTTGGGGAGACGCrGCTACTAGAGCTGGAGCAGGACTCCGGTGTGCAGGTCGAGGGGCTGACAGTGCAGTA 
CCTGGGCCAGGCGCCTGAGCTGCTGGGTGGAGCAGAGCCTGGCACCTACCTGACTGGCACCATCAATGGAGATCC 
GGAGTCGGTGGCATCTCTGCACTGGGATGGGGGAGCCCTGTTAGGCGTGTTACAATATCGGGGGGCTGAACTCCA 
CCTCCAGCCCCTGGAGGGAGGCACCCCTAACTCTGCTGGGGGACCTGGGGCTCACATCCTACGCCGGAAGAGTCC 
TGCCAGCGGTC^GGTCCCATGTGCAACGTCAAGGCTCCTCTTGGAAGCCCCAGCCCCAGACCCCGAAGAGCCAA 
GCGCTTTGCTTCACTGAGTAGATTTGTGGAGACACTGGTGGTGGCAGATGACAAGATGGCCGCATTCCAOSGTGC 
GGGGCTAAAGCGCTACCTGCTAACAGTGATGGCAGCAGCAGCCAAGGCCTTCAAGCACCCAAGCATCCGCAATCC 
TGTCAGCTTGGTGGTGACTCGGCTAGTGATCCTGGGGTCAGGCGAGGAGGGGCCCCAAGTGGGGCCCAGTGCTGC 
CCAGACCCTGCGCAGCTTCTGTGCCTGGCAGCGGGGCCTCAACACCCCTGAGGACTCGGGCCCTGACCACTTTGA 
CACAGCCATTCTGTTTACCCGTCAGGACCTGTGTGGAGTCTCCACTTGCGACACGCTGGGTATGGCTGATGTGGG 
CACCGTCTGTGACCCGGCTCGGAGCTGTGCCATTGTGGAGGATGATGGGCTCCAGTCAGCCTTCACTGCTGCTCA 
TGAACTGGGTCATGTCTTCAACATGCTCCATGACAAC 

CTCTCGCCATGTCATGGCCCCTGTGATGGCTCATGTGGATCCTGAGGAGCCCTGGTCCCCCTGCAGTGCCCGCTT 
CATC^CTGACTTCCTGGACAATGGCTATGGGCACTGTCTCrTAGACAAACCAGAGGCTCCATTGCATCTGCCTGT 
GACTTTCCCTGGCAAGGACTATGATGCTGACCGCCAGTGCCAGCTGACCTTCGGGCCCGACTCACGCC^TTGTCC 
ACAGCTGCCGCCGCCCTGTGCTGCCCTCTGGTGCTCTGGCCACCTCAATGGCCATGCCATGTGCCAGACCAAACA 
CTCGCCCTGGGCCGATGGCACACCCTGCGGGCCCGCACAGGCCTGCATGGGTGGTCGCTGCCTCCACATGGACCA 
GCTCCAGGACTTCAATATTCCACAGGCTGGTGGCTGGGGTCCTTGGGGACCATGGGGTGACTGCTCTCGGACCTG 
TGGGGGTGGTGTCCAGTTCTCCTCCCGAGACTGCACGAGGCCTGTCCCCCGGAATGGTGGCAAGTACTGTGAGGG 
CCGCCGTACCCGCTTCCGCTCCTGCAACACTGAGGACTGCCCAACTGGCTCAGCCCTGACCTTCCGCGAGGAGCA 
GTGTGCTGCCTACAACCACCGCACCGACCTCTTCAAGAGCTTCCCAGGGCCCATGGACTGGGTTCCTCGCTACAC 
AGGCGTGGCCCCCCAGGACCAGTGCAAACTCACCTGCCAGGCCCGGGCACTGGGCTACTACTATGTGCTGGAGCC 
ACGGGTGGTAGATGGGACCCCCTGTTCCCCGGACAGCTCCTCGGTCTGTGTCCAGGGCCGATGCATCCMGCTGG 
CTGTGATCGCATCATTGGCTCCAAGAAGAAGTTTGA 

CAAGCAGTCAGGCTCCTTCAGGAAATTCAGGTACGGATACAACAATGTGGTCACTATCCCCGCGGGGGCCACCCA 
CATTCTTGTCCGGCAGCAGGGAAACCCTGGC CACCGGAGCATCTACTTGGCCCTGAAGCTG CCAG ATGGCTCCTA 
TGCCCTCAATGGTGAATACACGCTGATGCCCTCCCC 
CAGCGGGGCCACTCCAGCCTCAGAGACACrGTCAGGCCATG^ 

AGTGGCTGGCAACCCCCAGGACACACGCCTCCGATACAGCTTCTTCGTGCCCCGGCCGACCCCTTCAACGCCACG 
CCCC ACTCCCC AGGACTGGCTGCAC CGAAGAG CACAGATTCTGGAGATCCTTCGGCGGCGCCCCTGGGCGGGCAG 
GAA ATAA CCTCACTATCCCGGCTGCCCTTTCTGGGCACCGGGGCCTCGGACTTAGCTGGGAGAAAGAGAGAGC^ 
CTGTTGCTGCCTCATGCTAAGACTCAGTGGGGAGGGGCTGTGGGCGTGAGACCTGCCCCTCCTCTCTGCCCTAAT 
GCGCAGGCTGGCCCTGCCCTGGTTTCCTGCCCTGGGAGGCAGTGATGGGTTAGTGGATGGAAGGGGCTGACAGAC 
AGCCCTCCATCTAAACTGCCCCCTCTGCCCTGCGGGTCACAGGAGGGAGGGGGAAGGCAGGGAGGGCCTGGGCCC 
CAGTTGTATTTATTTAGTATTTATTCACTTC 

CCTGACCCCTGACCCCTCATAGCCCTCACCCTGX^CTAGGAAATCCAGGGTGGTGGTGATAGGTATAAGTGGTG 

TGTGTATGCGTGTGTGTGTGTGTGTGAAAATGTGTGTGTGCTTATGTATGAGGTACAACCTGTTCTGCTTTCCTC 

TTCCTGAATTTTATTTTTTGGGAAAAGAAAAGTCAAGGGTAGGGTGGGCCTTCAG 

TTTTTTTTTCTTTCTTTCTTTCTTT 

GCACAATCTCG<3CTCACTGCATCCTCCGCCTCC^ 

GGATTACAGGCTCCTGCCACCACGCCCAGCTAATTTTTGTTTTGTTTTGTTTGGAGACAGAGTCT 
ACCAGGGCTGXSAATGATTTCAGCTGACTGCAAC^ 

CGAGTAGCTGAGATTATAGGCACCTACCACCACGCCCGGCTAATTTTTGTATTTTTAGTAGAGACGG^ 

CATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTTAGGTGATCCACTCGCCTTCATCTCCCAAAGTGCTG^GATT 

ACAGGCGTGAGCCACCGTGCCTGGCCACGCCCAACTAATTTrc 

TGGCCAGGCTGCTCTTGAACTCCTGACCTCAGGTAATCGACCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGG 
TGTGAG C C ACCACG C C CGGT ACAT ATTTTTTAAATTG AATTCT ACT ATTT ATGTG ATC CTTTTGG AGT CAG ACAG 
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FTGIJKE 179B 

ATGTGGTTGCATCCTAACTCCATGTCTCTGAGCATTAGATTTCTCATTTGCCAATAATAATACCTCCCTTAGAAG 
TTTGTTGTGAGGATTAAATAATGTAAATAAAGAACTAGCATAACACTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGAAA 
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FIGURE 180 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73492 
xsuburiit 1 of 1, 837 aa, 1 stop 
><MW: 90167, pi: 8.39, NX(S/T): 1 

MSQTGSHPGRGLAGRWLWGAQPCLLLPIVPLSWLVWLLLLLLASLLPSARLASPIiPREEEIV 

FPEKLNGSVLPGSGAPARLLCRLQAFGETLLLELEQDSGVQVEGLTVQYLGQAPELLGGAEP 

GTYLTGTINGDPESVASLHWDGGALLGVLQYRGAELHLQPLEGGTPNSAGGPGAHILRRKSP 

ASGQGPMCNVKAPLGSPSPRPRRAKRFASLSRFVETLWADDKMAAFHGAGLKRYLLTVMAA 

AAKAFKHPSIRNPVSLWTRLVILGSGEEGPQVGPSAAQTLRSFCAWQRGLNTPEDSGPDHF 

DTAILFTRQDLCGVSTCTTLGMADVGTVCDPARSCAIVEDDGLQSAFTAAHELGHVFNMLHD 

NSKPCI SLNGPLSTSRHVMAPVMAHVDPEE PWS PCSARF ITDFLDNGYGHCLLDKPEAPLHL 

PVTFPGKDYDADRQCQLTFGPDSRHCPQLPPPCAALWCSGHLNGHAMCQTKHSPWADGTPCG 

PAQACMGGRCLHMDQLQDFNIPQAGGWGPWGPWGDCSRTCGGGVQFSSRDCTRPVPRNGGKY 

CEGRRTRFRSCNTEDCPTGSALTFREEQCAAYNHRTDLFKSFPGPMDWVPRYTGVAPQDQCK 

LTCQARALGYYYVLEPR WDGTPCS PDS SS VCVQGRC I HAGCDR 1 1 GS KKKFDKCMVCGGDG 

SGCSKQSGSFRKFRYGYNNWTIPAGATHILVRQQGNPGHRSIYLALKLPDGSYALNGEYTL 

MPSPTDWLPGAVSLRYSGATAASETLSGHGPLAQPLTLQVLVAGNPQDTRLRYSFFVPRPT 

PSTPRPTPQDWliHRRAQILEILRRRPWAGRK 

Important features of the protein: 

Signal peptide: 

amino acids 1-48 

N-glycosylation site. 

amino acids 68-71 

Glycosaminoglycan attachment site 
amino acids 188-191, 772-775 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 182-185 

Tyrosine kinase phosphorylation site. 

amino acids 730-736 

N-myristoylation sites. 

amino acids 5-10, 19-24, 121-126, 125-130, 130-135, 147-152, 167- 

172, 168-173, 174-179, 323-328, 352-357, 539-544, 555-560, 577- 

582, 679-684, 682-687, 763-768 

Ami da t ion sites. 

amino acids 560-563, 834-837 

Leucine zipper pattern. 

amino acids 17-38, 24-45 

Neutral zinc metallopeptidas s, zinc-binding region signature, 
amino acids 358-367 
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FIGURE 181 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGG 
CAAAGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCC 
AAGAAAATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCT 
AATTGTCCTGTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACA 
TGGAGCACACTTTCTACAGCAATGGAGAGAAGAAGAAGATTTACATGGAAATTGATCCTGTG 
ACCAGAACTGAAATATTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTT 
TAAAAACGGATACACTGGCATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGA 
TTAAAGTGATTCCTGAATTTTCTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACC 
ACAACTTTCTTTGAACAGTCAGTGATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGA 
TTTTCTTAAAAATTCCAAAATTCTGGAGATTTGTGATAACGTGACCATGTATTGGATCAATC 
CCACTCTAATATCAGTTTCTGAGTTACAAGACTTTGAGGAGGAGGGAGAAGATCTTCACTTT 
CCTGCCAACGAAAAAAAAGGGATTGAACAAAATGAACAGTGGGTGGTCCCTCAAGTGAAAGT 
AGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGAAGAACTTCCAATAAATGACTATACTG 
AAAATGGAATAGAATTTGATCCCATGCTGGATGAGAGAGGTTATTGTTGTATTTACTGCCGT 
CGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTAGGCTACTACCCATATCCATA 
CTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTGTAACTGGTGGGTGGCCC 
GCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAACTGCTGGCAACATAT 
AATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCTGGTAGCCAGCT 
CTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACATTATCACC 
AAAAAAAAAAAAAAAAAAA 
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FTGTJRE 182 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73727 
xsubunit 1 of 1, 317 aa, 1 stop 
><MW: 37130, pi: 5.18, NX(S/T): 3 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAY 
DMEHTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKT 
QIKVIPEFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWI 
NPTLISVSELQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDY 
TENGIEFDPMLDERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWV 
ARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 0 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C- terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme -binding site signature. 

amino acids 9-14 

/g5/27£) 
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FIGURE 183 

GCGGAACTGGCTCCGGCTGGCACCTGAGGAGCGGCGTGACCCCGAGGGCCCAGGGAGCTGCC 
CGGCTGGCCTAGGCAGGCAGCCGCACCATGGCCAGCACGGCCGTGCAGCTTCTGGGCTTCCT 
GCTCAGCTTCCTGGGCATGGTGGGCACGTTGATCACCACCATCCTGCCGCACTGGCGGAGGA 
CAGCGCACGTGGGCACCAACATCCTCACGGCCGTGTCCTACCTGAAAGGGCTCTGGATGGAG 
TGTGTGTGGCACAGCACAGGCATCTACCAGTGCCAGATCTACCGATCCCTGCTGGCGCTGCC 
CCAAGACCTCCAGGCTGCCCGCGCCCTCATGGTCATCTCCTGCCTGCTCTCGGGCATAGCCT 
GCGCCTGCGCCGTCATCGGGATGAAGTGCACGCGCTGCGCCAAGGGCACACCCGCCAAGACC 
ACCTTTGCCATCCTCGGCGGCACCCTCTTCATCCTGGCCGGCCTCCTGTGCATGGTGGCCGT 
CTCCTGGACCACCAACGACGTGGTGCAGAACTTCTACAACCCGCTGCTGCCCAGCGGCATGA 
AGTTTGAGATTGGCCAGGCCCTGTACCTGGGCTTCATCTCCTCGTCCCTCTCGCTCATTGGT 
GGCACCCTGCTTTGCCTGTCCTGCCAGGACGAGGCACCCTACAGGCCCTACCAGGCCCCGCC 
CAGGGCCACCACGACCACTGCAAACACCGCACCTGCCTACCAGCCACCAGCTGCCTACAAAG 
ACAATCGGGCCCCCTCAGTGACCTCGGCCACGCACAGCGGGTACAGGCTGAACGACTACGTG 
TGAGTCCCCACAGCCTGCTTCTCCCCTGGGCTGCTGTGGGCTGGGTCCCCGGCGGGACTGTC 
AATGGAGGCAGGGGTTCCAGCACAAAGTTTACTTCTGGGCAATTTTTGTATCCAAGGAAATA 
ATGTGAATGCGAGGAAATGTCTTTAGAGCACAGGGACAGAGGGGGAAATAAGAGGAGGAGAA 
AGCTCTCTATACCAAAGACTGAAAAAAAAAATCCTGTCTGTTTTTGTATTTATTATATATAT 
TTATGTGGGTGATTTGATAACAAGTTTAATATAAAGTGACTTGGGAGTTTGGTCAGTGGGGT 
TGGTTTGTGATCCAGGAATAAACCTTGCGGATGTGGCTGTTTATGAAAAAAAAAAAAA 
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FIGURE 184 

MASTAVQLLGFLLSFLGMVGTLITTILPHWRRTAHVGTNILTAVSYLKGLWMECVWHSTGIY 
QCQIYRSLLALPQDLQAARALMVISCLLSGIACACAVIGMKCTRCAKGTPAKTTFAILGGTL 
FILAGLLCMVAVSWTTNDWQNFYNPLLPSGMKFEIGQALYLGFISSSLSLIGGTLLCLSCQ 
DEAPYRPYQAPPRATTTTANTAPAYQPPAAYKDNRAPSVTSATHSGYRLNDYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domains: 

amino acids 82-103, 115-141, 160-182 
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FIGURE 185 

GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCG 
CAGGCGGCAGGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCT 
CCTGTCCATCCTGGGGCTGGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCC 
AGGACCTGTACGACAACCCCGTCACCTCCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGC 
GTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCCTATTTCACCATCCTGGGACTTCCAGC 
CATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCTGGGTGCCATTGGCCTCC 
TGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTGCCAAAGCC 
AACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGT 
GTCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCG 
GCATGGGTGGGATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTG 
GGCTGGGTCGCTGGAGGCCTCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGG 
CCTGGCACCAGAAGAAACCAACTACAAAGCCGTTTCTTATCATGCCTCAGGCCACAGTGTTG 
CCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTTTGGGTCCAACACCAAAAACAAGAAG 
AT ATACGATGGAGGTGC C CGCACAGAGGACGAGGTACAATCTTATCCTTC CAAG CACGACTA 
TGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCACCCAAAA 
AACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGC 
CTCGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCC 
ACCATAAAACAGCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTAT 
TTCTTTTTTTAAATATAACTTTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCAC 
ATTTTGATGATTTAGACAGACTCCCCCTCTTCCTCCTAGTCAATAAACCCATTGATGATCTA 
TTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAAGAGTAGACCCAAAGATGTTATTTT 
CTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAAACACTTACTGAAGAA 
GAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTTACACTG 
TGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTG 
CTGTTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAG 
TCCTCTTTCTGTCGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAAT 
TTAAGTCCTAAATATAGTTAAAATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGA 
AATAGCCTCACCCCTACATGTGGATAGAAGGAAATGAAAAAATAATTGCTTTGACATTGTCT 
ATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCCATGAAAAGCTCACACCTGTAATC 
CTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGTTCGAGACTAGCCTG 
GGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCATGGTG 
GCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGG 
AGGTTGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGA 
TCCTGTCTAAAAAAATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAA 
ACTAATTCTTTAA 
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FIGURE 186 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA73734 
xsubunit l of l, 261 aa, l stop 
><MW: 27856, pi: 8.50, NX(S/T): 1 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTE 
CRPYFT I LGLPAMLQAVRALM I VG I VLGAI GLLVS I FALKCI RI GSMEDS AKANMTLTSGI M 
F I VSGLCAI AGVSVFANMLVTNFWMSTANM YTGMGGMVQ WQTRYTFGAALFVGWVAGGLTL 
IGGVMMCIACRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTE 
DEVQSYPSKHDYV 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FTGTJRE 187 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCG 
GAGTCCAGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTG 
GGCTGTTTCTTGGTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGG 
AGAGTGTCGGCCTTCATTGAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTG 
GATGAATTGCGTGAGGCAGGCTAACATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGG 
CTCTTTCTCCGGACCTACAGGCAGCCAGAGGACTGATGTGTGCTGCTTCCGTGATGTCCTTC 
TTGGCTTTCATGATGGCCATCCTTGGCATGAAATGCACCAGGTGCACGGGGGACAATGAGAA 
GGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCATCATCACGGGCATGGTGGTGC 
TCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATAACTCAATAGTGAAT 
GTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCACTGGTGCT 
GATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTACA 
GATACTCGATAC CTTC CCAT CGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGT C ACCG 
AGCGTCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGC 
CATGCAAATGACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTA 
CTGTTCTTAACTGCCTAATCTTAATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAA 
GCTATTTCAGCAGAATGAGATATTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAAT 
TTGTTTTCTAAGGTGGTTCAAGCATCTACTCTTTTTATCATTTACTTCAAAATGACATTGCT 
AAAGACTGCATTATTTTACTACTGTAATTTCTCCACGACATAGCATTATGTACATAGATGAG 
TGTAACATTTATATCTCACATAGAGACATGCTTATATGGTTTTATTTAAAATGAAATGCCAG 
TCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAAATCATGGATAGGGTTG 
AAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCATTTATAATGA 
AGATTAAAATGAAGGCTTTAATCAGCATTGTAAAGGAAATTGAATGGCTTTCTGATATGCTG 
TTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTT 
TTCTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCA 
AACTGCTTTTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATG 
GTTTTAGGAAAGTGAAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAA 
GAAATCATATATGTATGGATATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATC 
AATATAAATAAAAGAGCAGAAAAATATGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACA 
ACAAAAAAAGTTGTCCTTTGAGAACTTCACCTGCTCCTATGTGGGTACCTGAGTCAAAATTG 
TCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTACCATTTCTGTTTAGTTTTACTAAA 
ATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAAACTGACAATCCAATTTGA 
AAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTTTATACATTTATA 

TTAATAAATTGTACATTTTTCTAATT 
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FTGTTRR 188 



></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA73 73 5 
xsubunit 1 of 1, 225 aa, 1 stop 
><MW: 24845, pi: 9.07, NX{S/T): 0 

^THALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRM 
QCKIYDSLl^LSPDLQAARGLMCAASVMSFLAFMMAILGMKCTRCTGDNEKVKAHILLTAGI 
IFI ITGMWLIPVSWVANAI IRDFYNSIVNVAQKRELGEALYLGWTTALVLIVGGALFCCVF 
CCNEKSSS YRYS I PSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 

Transmembrane domains : 

amino acids 82-101, 118-145, 164-188 




WO 00/12708 



PO7US99/201I1 



FIGURE 189 

TCGCC^TGGCCTCTGCCGGAATGCAGATCCTGGGAGTCGTCCTGACACTGCTGGGCTGGGTG 
AATGGCCTGGTCTCCTGTGCCCTGCCCATGTGGAAGGTGACCGCTTTCATCGGCAACAGCAT 
CGTGGTGGCCCAGGTGGTGTGGGAGGGCCTGTGGATGTCCTGCGTGGTGCAGAGCACCGGCC 
AGATGCAGTGCAAGGTGTACGACTCACTGCTGGCGCTGCCACAGGACCTGCAGGCTGCACGT 
GCCCTCTGTGTCATCGCCCTCCTTGTGGCCCTGTTCGGCTTGCTGGTCTACCTTGCTGGGGC 
CAAGTGTACCACCTGTGTGGAGGAGAAGGATTCCAAGGCCCGCCTGGTGCTCACCTCTGGGA 
TTGTCTTTGTCATCTCAGGGGTCCTGACGCTAATCCCCGTGTGCTGGACGGCGCATGCCATC 
ATCCGGGACTTCTATAACCCCCTGGTGGCTGAGGCCCAAAAGCGGGAGCTGGGGGCCTCCCT 
CTACTTGGGCTGGGCGGCCTCAGGCCTTTTGTTGCTGGGTGGGGGGTTGCTGTGCTGCACTT 
GCCCCTCGGGGGGGTCCCAGGGCCCCAGCCATTACATGGCCCGCTACTCAACATCTGCCCCT 
GCCATCTCTCGGGGGCCCTCTGAGTACCCTACCAAGAATTACGTCTGACGTGGAGGGGAATG 
GGGGCTCCGCTGGCGCTAGAGCCATCCAGAAGTGGCAGTGCCCAACAGCTTTGGGATGGGTT 
CGTACCTTTTGTTTCTGCCTCCTGCTATTTTTCTTTTGACTGAGGATATTTAAAATTCATTT 
GAAAACTGAGCCAAGGTGTTGACTCAGACTCTCACTTAGGCTCTGCTGTTTCTCACCCTTGG 
ATGATGGAGCCAAAGAGGGGATGCTTTGAGATTCTGGATCTTGACATGCCCATCTTAGAAGC 
CAGTCAAGCTATGGAACTAATGCGGAGGCTGCTTGCTGTGCTGGCTTTGCAACAAGACAGAC 
TGTCCCCAAGAGTTCCTGCTGCTGCTGGGGGCTGGGCTTCCCTAGATGTCACTGGACAGCTG 
CCCCCCATCCTACTCAGGTCTCTGGAGCTCCTCTCTTCACCCCTGGAAAAACAAATCATCTG 
TTAACAAAGGACTGCCCACCTCCGGAACTTCTGACCTCTGTTTCCTCCGTCCTGATAAGACG 
TCCACCCCCCAGGGCCAGGTCCCAGCTATGTAGACCCCCGCCCCCACCTCCAACACTGCACC 
CTTCTGCCCTGCCCCCCTCGTCTCACCCCCTTTACACTCACATTTTTATCAAATAAAGCATG 
TTTTGTTAGTGCA 



WO 00/12708 



PCT/US99/20U1 



FIGURE 190 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73736 
xsubunit 1 of 1, 220 aa, 1 stop 
><MW: 23292, pi: 8.43, NX(S/T): 0 
MASAGMQILGWLTLLGWVNGLVSCALPMWKVTAF^^ 
QCKVYDSLLALPQDLQAARALCVIALLVALFGL^ 

FVI SGVLTLI PVCWTAHAI I RDFYNPLVAEAQKRELGASLYLGWAASGLLLLGGGLLCCTCP 
SGGSQGPSHYMARYSTSAPAISRGPSEYPTKNYV 

Transmembrane domains: 

amino acids 8-30 (type II), 82-102, 121-140, 166-186 




WO 00/12708 



PCTAJS99/201H 



FIGURE 191 

GC CAAGGAGAACATCATCAAAGACTTCT CTAGACTCAAAAGGCTTC CACGTT CTAC ATCTTG 
AGCATCTTCTACCACTCCGAATTGAACCAGTCTTCAAAGTAAAGGCAATGGCATTTTATCCC 
TTGCAAATTGCTGGGCTGGTTCTTGGGTTCCTTGGCATGGTGGGGACTCTTGCCACAACCCT 
TCTGCCTCAGTGGTGGAGTATCAGCTTTTGTTGGCAGCAACATTATTGTCTTTGAGAGGCTC 
TGGGAAGGGCTCTGGATGAATTGCATCCGACAAGCCAGGGTCCGGTTGCAATGCAAGTTCTA 
TAGCTCCTTGTTGGCTCTCCCGCCTGCCCTGGAAACAGCCCGGGCCCTCATGTGTGTGGCTG 
TTGCTCTCTCCTTGATCGCCCTGCTTATTGGCATCTGTGGCATGAAGCAGGTCCAGTGCACA 
GGCTCTAACGAGAGGGCCAAAGCATACCTTCTGGGAACTTCAGGAGTCCTCTTCATCCTGAC 
GGGTATCTTCGTTCTGATTCCGGTGAGCTGGACAGCCAATATAATCATCAGAGATTTCTACA 
ACCCAGCCATCCACATAGGTCAGAAACGAGAGCTGGGAGCAGCACTTTTCCTTGGCTGGGCA 
AGCGCTGCTGTCCTCTTCATTGGAGGGGGTCTGCTTTGTGGATTTTGCTGCTGCAACAGAAA 
GAAGCAAGGGTACAGATATCCAGTGCCTGGCTACCGTGTGCCACACACAGATAAGCGAAGAA 
ATACGACAATGCTTAGTAAGACCTCCACCAGTTATGTC TAA TGCCTCCTTTTGGCTCCAAGT 
ATGGACTATGGTCAATGTTTTTTATAAAGTCCTGCTAGAAACTGTAAGTATGTGAGGCAGGA 
GAACTTGCTTTATGTCTAGATTTACATTGATACGAAAGTTTCAATTTGTTACTGGTGGTAGG 
AATGAAAATGACTTACTTGGACATTCTGACTTCAGGTGTATTAAATGCATTGACTATTGTTG 
GACCCAATCGCTGCTCCAATTTTCATATTCTAAATTCAAGTATACCCATAATCATTAGCAAG 
TGTACAATGATGGACTACTTATTACTTTTTGACCATCATGTATTATCTGATAAGAATCTAAA 
GTTGAAATTGATATTCTATAACAATAAAACATATACCTATTCTA 



WO 00/12708 
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FIGURE 192 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73737 
xsubunit 1 of 1, 173 aa, 1 stop 
><MW: 18938, pi: 9.99, NX(S/T): 1 

MNCIRQARVRLQCKFYSSLLALPPALETARALMCVAVALSLI ALLI G I CGMKQVQCTGSNER 
AKAYLLGTSG VLF I LTG I FVLI PVSWTANI 1 1 RDFYNP AIH I GQKRELGAALFLGWAS AAVL 
F I GGGLLCGF CCCNRKKQGYRYP VPGYRVPHTDKRRNTTMLS KTS TS YV 

Important features of the protein: 
Transmembrane domains : 

amino acids 31-51, 71-90, 112-133 

N-glycosylation site. 

amino acids 161-164 



/9S/27D 



WO 00/12708 



PCT/US99/201U 



FTGURE 193 

AGTGACAATCTCAGAGCAGCTTCTACACCACAGCCATTTCCAGC43CGAAGATCACTGGGGGT 
CTCCTTCTGCTCTGTACAGTGGTCTATTTCTGTAGCAGCTCAGAAGCTGCTAGTCTGTCTCC 
AAAAAAAGTGGACTGCAGCATTTACAAGAAGTATCCAGTGGTGGCCATCCCCTGCCCCATCA 
CATACCTACCAGTTTGTGGTTCTGACTACATCACCTATGGGAATGAATGTCACTTGTGTACC 
GAGAGCTTGAAAAGTAATGGAAGAGTTCAGTTTCTTCACGATGGAAGTTGCTAAATTCTCCA 
TGGACATAGAGAGAAAGGAATGATATTCTCATCATCATCTTCATCATCCCAGGCTCTGACTG 
AGTTTCTTTCAGTTTTACTGATGTTCTGGGTGGGGGACAGAGCCAGATTCAGAGTAATCTTG 
ACTGAATGGAGAAAGTTTCTGTGCTACCCCTACAAACCCATGCCTCACTGACAGACCAGCAT 
TTTTTTTTTAACACGTCAATAAAAAAATAATCTCC CAGA 



WO 00/12708 



PCT/US99/201I1 



FIGURE 194 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA7373 9 
xsubunit 1 of 1, 85 aa, 1 stop 
><MW: 9232, pi: 7.94, NX(S/T): 0 

MKITGGLLLLCTWYFCSSSEAASLSPKKVDCSIYKKYPWAIPCPITYLPVCGSDYITYGN 
ECHLCTESLKSNGRVQFLHDGSC 

Signal peptide: 

amino acids 1-19 



/ 9 



WO 00/12708 



PCT/US99/20111 



FTGIJRE 195 

CCCGCGCCCGGTTCTCCCTCGCAGCACCTCGAAGTGCGCCCCTCGCCCTCCTGCTCGCGCCC 
CGCCGCCATGGCTGCCTCCCCCGCGCGGCCTGCTGTCCTGGCCCTGACCGGGCTGGCGCTGC 
TCCTGCTCCTGTGCTGGGGCCCAGGTGGCATAAGTGGAAATAAACTCAAGCTGATGCTTCAA 
AAACGAGAAGCACCTGTTCCAACTAAGACTAAAGTGGCCGTTGATGAGAATAAAGCCAAAGA 
ATTCCTTGGCAGCCTGAAGCGCCAGAAGCGGCAGCTGTGGGACCGGACTCGGCCCGAGGTGC 
AGCAGTGGTACCAGCAGTTT CT CTAC ATGGGCTTTGATGAAG CGAAATTTGAAGATGACATC 
ACCTATTGGCTTAACAGAGATCGAAATGGACATGAATACTATGGCGATTACTACCAACGTCA 
CTATGATGAAGACTCTGCAATTGGTCCCCGGAGCCCCTACGGCTTTAGGCATGGAGCCAGCG 
TCAACTACGATGACTACTAACCATGACTTGCCACACGCTGTACAAGAAGCAAATAGCGATTC 
TCTTCATGTATCTCCTAATGCCTTACACTACTTGGTTTCTGATTTGCTCTATTTCAGCAGAT 
CTTTTCTACCTACTTTGTGTGATCAAAAAAGAAGAGTTAAAACAACACATGTAAATGCCTTT 
TGATATTTCATGGGAATGCCTCT CATTTAAAAATAGAAATAAAG CATTTTGTTAAAAAGA 



ml*™ 



WO 00/12708 
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FIGURE 196 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73742 
xsubunit 1 of 1 ( 148 aa, 1 stop 
><MW: 17183, pi: 8.77, NX(S/T): 0 

MAAS PARPAVLALTGLALLLLLCWGPGG I SGNKLKLMLQKRE AP VPTKTKVAVDENKAKEFL 
GSLKRQKRQLWDRTRPEVQQWYQQFLYMGFDEAKFEDDITYWLNRDRNGHEYYGDYYQRHYD 
EDSAIGPRSPYGFRHGASVNYDDY 

Signal peptide: 

amino acids 1-3 0 



; 9T/X70 



WO 00/12708 



PCIYIjS99/20m 



FIGURE 197 



CGGCTCGAGCCCGCCCGGAAGTGCCCGAGGGGCCGCGATGGAGCTGGGGGAGCCGGGCGCTC 
GGTAGCGCGGCGGGCAAGGCAGGCGCCATGACCCTGATTGAAGGGGTGGGTGATGAGGTGAC 
CGTCCTTTTCTCGGTGCTTGCCTGCCTTCTGGTGCTGGCCCTTGCCTGGGTCTCAACGCACA 
CCGCTGAGGGCGGGGACCCACTGCCCCAGCCGTCAGGGACCCCAACGCCATCCCAGCCCAGC 
GCAGCCATGGCAGCTACCGACAGCATGAGAGGGGAGGCCCCAGGGGCAGAGACCCCCAGCCT 
GAGACACAGAGGTCAAGCTGCACAGCCAGAGCCCAGCACGGGGTTCACAGCAACACCGCCAG 
CCCCGGACTCCCCGCAGGAGCCCCTCGTGCTACGGCTGAAATTCCTCAATGATTCAGAGCAG 
GTGGCCAGGGCCTGGCCCCACGACACCATTGGCTCCTTGAAAAGGACCCAGTTTCCCGGCCG 
GGAACAGCAGGTGCGACTCATCTACCAAGGGCAGCTGCTAGGCGACGACACCCAGACCCTGG 
GCAGCCTTCACCTCCCTCCCAACTGCGTTCTCCACTGCCACGTGTCCACGAGAGTCGGTCCC 
CCAAATCCCCCCTGCCCGCCGGGGTCCGAGCCCGGCCCCTCCGGGCTGGAAATCGGCAGCCT 
GCTGCTGCCCCTGCTGCTCCTGCTGTTGCTGCTGCTCTGGTACTGCCAGATCCAGTACCGGC 
CCTTCTTTCCCCTGACCGCCACTCTGGGCCTGGCCGGCTTCACCCTGCTCCTCAGTCTCCTG 
GCCTTTGCCATGTACCGCCC GTAG TGCCTCCGCGGGCGCTTGGCAGCGTCGCCGGCCCCTCC 
GGACCTTGCTCCCCGCGCCGCGGCGGGAGCTGCTGCCTGCCCAGGCCCGCCTCTCCGGCCTG 
CCTCTTCCCGCTGCCCTGGAGCCCAGCCCTGCGCCGCAGAGGACTCCCGGGACTGGCGGAGG 
CCCCGCCCTGCGACCGCCGGGGCTCGGGGCCACCTCCCGGGGCTGCTGAACCTCAGCCCGCA 
CTGGGAGTGGGCTCCTCGGGGTCGGGCATCTGCTGTCGCTGCCTCGGCCCCGGGCAGAGCCG 
GGCCGCCCCGGGGGCCCGTCTTAGTGTTCTGCCGGAGGACCCAGCCGCCTCCAATCCCTGAC 
AGCTCCTTGGGCTGAGTTGGGGACGCCAGGTCGGTGGGAGGCTGGTGAAGGGGAGCGGGGAG 
GGGCAGAGGAGTTCCCCGGAACCCGTGCAGATTAAAGTAACTGTGAAGTTTTAAAAAAAAAA 
AAAAAAAA 
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FIGURE 198 

MTLIEGVGDEVTVLFSVLACLLVLALAWSTHTAEGGDPLPQPSGTPTPSQPSAAMAATDSM 
RGEAPGAETPSLRHRGQAAQPEPSTGFTATPPAPDSPQEPLVLRLKFLNDSEQVARAWPHDT 
IGSLKRTQFPGREQQVRLIYQGQLLGDDTQTLGSLHLPPNCVLHCHVSTRVGPPNPPCPPGS 
EPGPSGLEIGSLLLPLLLLLLLLLWYCQIQYRPFFPLTATLGLAGFTLLLSLLAFAMyRP 

Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 195-217 



WO 00/12708 



PCT/US99/20111 



FIGURE 199 

GAGATTGGAAACAGCCAGGTTGGAGCAGTGAGTGAGTAAGGAAACCTGGCTGCCCTCTCCAG 
ATTCCCCAGGCTCTCAGAGAAGATCAGCAGAAAGTCTGCAAGACCCTAAGAACCATCAGCCC 
TCAGCTGCACCTCCTCCCCTCCAAGGATGACAAAGGCGCTACTCATCTATTTGGTCAGCAGC 
TTTCTTGCCCTAAATCAGGCCAGCCTCATCAGTCGCTGTGACTTGGCCCAGGTGCTGCAGCT 
GGAGGACTTGGATGGGTTTGAGGGTTACTCCCTGAGTGACTGGCTGTGCCTGGCTTTTGTGG 
AAAGCAAGTTCAACATATCAAAGATAAATGAAAATGCGGATGGAAGCTTTGACTATGGCCTC 
TTCCAGATCAACAGCCACTACTGGTGCAACGATTATAAGAGTTACTCGGAAAACCTTTGCCA 
CGTAGACTGTCAAGATCTGCTGAATCCCAACCTTCTTGCAGGCATCCACTGCGCAAAAAGGA 
TTGTGTCCGGAGCACGGGGGATGAACAACTGGGTAGAATGGAGGTTGCACTGTTCAGGCCGG 
CCACTCTCCTACTGGCTGACAGGATGCCGCCTGAGATGAAACAGGGTGCGGGTGCACCGTGG 
AGTCATTCCAAGACTCCTGTCCTCACTCAGGGATTCTTCATTTCTTCTTCCTACTGCCTCCA 
CTTCATGTTATTTTCTTCCCTTCCCATTTACAACTAAAACTGACCAGAGCCCCAGGAATAAA 
TGGTTTTCTTGGCTTCCTCCTTACTCCCATCTGGACCCAGTCCCCTGGTTCCTGTCTGTTAT 
TTGTAAACTGAGGACCACAATAAAGAAATCTTTATATTTATCG 



WO 00/12708 
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FIGURE 200 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA7374 6 
xsubunit l of l, 148 aa, 1 stop 
><MW: 16896, pi: 6.05 # NX(S/T): 1 

MTKALLIYLVSSFIoALNQASLISRCDLAQVLQLEDLDGFEGYSLSDWLCLAFVESKFNISKI 
NENADGSFDYGLFQINSHYWCNDYKSYSENLCHVDCQDLLNPNLLAGIHCAKRIVSGARGMN 
NWVEWRLHCSGRPLSYWLTGCRLR 

Signal peptide: 

amino acids 1-18 



WO 00/12708 
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FIGURE 201 

TCTGACCTGACTGGAAGCGTCCAAAGAGGGACGGCTGTCAGCCCTGCTTGACTGAGAACCCA 

CCAGCTCATCCCAGACACCTCATAGCAACCTATTTATACAAAGGGGGAAAGAAACACCTGAG 

CAGAATGGAATCATTATTTTTTTCCCAAGGAGAAAACCGGGGTAAAGGGAGGGAAGCAATTC 

AATTTGAAGTCCCTGTGAATGGGCTTTCAGAAGGCAATTAAAGAAATCCACTCAGAGAGGAC 

TTGGGGTGAAACTTGGGTCCTGTGGTTTTCTGATTGTAAGTGGAAGCAGGTCTTGCACACGC 

TGTTGGCAAATGTCAGGACCAGGTTAAGTGACTGGCAGAAAAACTTCCAGGTGGAACAAGCA 

ACCCATGTTCTGCTGCAAGCTTGAAGGAGCCTGGAGCGGGAGAAAGCTAACTTGAACATGAC 

CTGTTGCATTTGGCAAGTTCTAGCAACATGCTCCTAAGGAAGCGATACAGGCACAGACCATG 

CAGACTCCAGTTCCTCCTGCTGCTCCTGATGCTGGGATGCGTCCTGATGATGGTGGCGATGT 

TGCACCCTCCCCACCACACCCTGCACCAGACTGTCACAGCCCAAGCCAGCAAGCACAGCCCT 

GAAGCCAGGTACCGCCTGGACTTTGGGGAATCCCAGGATTGGGTACTGGAAGCTGAGGATGA 

GGGTGAAGAGTACAGCCCTCTGGAGGGCCTGCCACCCTTTATCTCACTGCGGGAGGATCAGC 

TGCTGGTGGCCGTGGCCTTACCCCAGGCCAGAAGGAACCAGAGCCAGGGCAGGAGAGGTGGG 

AGCTACCGCCTCATCAAGCAGCCAAGGAGGCAGGATAAGGAAGCCCCAAAGAGGGACTGGGG 

GGCTGATGAGGACGGGGAGGTGTCTGAAGAAGAGGAGTTGACCCCGTTCAGCCTGGACCCAC 

GTGGCCTCCAGGAGGCACTCAGTGCCCGCATCCCCCTCCAGAGGGCTCTGCCCGAGGTGCGG 

CACCCACTGTGTCTGCAGCAGCACCCTCAGGACAGCCTGCCCACAGCCAGCGTCATCCTCTG 

TTTCCATGATGAGGCCTGGTCCACTCTCCTGCGGACTGTACACAGCATCCTCGACACAGTGC 

CCAGGGCCTTCCTGAAGGAGAT CATCCT CGTGGACGAC CT CAGCCAGCAAGGAC AACTCAAG 

TCTGCTCTCAGCGAATATGTGGCCAGGCTGGAGGGGGTGAAGTTACTCAGGAGCAACAAGAG 

GCTGGGTGCCATCAGGGCCCGGATGCTGGGGGCCACCAGAGCCACCGGGGATGTGCTCGTCT 

TCATGGATGCCCACTG CG AGTG C CACCCAGGCTGGCTGGAGCCCCT CCTCAGCAGAATAGCT 

GGTGACAGGAGCCGAGTGGTATCTCCGGTGATAGATGTGATTGACTGGAAGACTTTCCAGTA 

TTACCCCTCAAAGGACCTGCAGCGTGGGGTGTTGGACTGGAAGCTGGATTTCCACTGGGAAC 

CTTTGCCAGAGCATGTGAGGAAGGCCCTCCAGTCCCCCATAAGCCCCATCAGGAGCCCTGTG 

GTGCCCGGAGAGGTGGTGGCCATGGACAGACATTACTTCCAAAACACTGGAGCGTATGACTC 

TCTTATGTCGCTGCGAGGTGGTGA7VAACCTCGAACTGTCTTTCAAGGCCTGGCTCTGTGGTG 

GCTCTGTTGAAATCCTTCCCTGCTCTCGGGTAGGACACATCTACCAAAATCAGGATTCCCAT 

TCCCCCCTCGACCAGGAGGCCACCCTGAGGAACAGGGTTCGCATTGCTGAGACCTGGCTGGG 

GTCATTCAAAGAAACCTTCTACAAGCATAGCCCAGAGGCCTTCTCCTTGAGCAAGGCTGAGA 

AGCCAGACTGCATGGAACGCTTGCAGCTGCAAAGGAGACTGGGTTGTCGGACATTCCACTGG 

TTTCTGGCTAATGTCTACCCTGAGCTGTACCCATCTGAACCCAGGCCCAGTTTCTCTGGAAA 

GCTCCACAACACTGGACTTGGGCTCTGTGCAGACTGCCAGGCAGAAGGGGACATCCTGGGCT 

GTCCCATGGTGTTGGCTCCTTGCAGTGACAGCCGGCAG CAACAGTACCTGCAGCACACCAG C 

AGGAAGGAGATTCACTTTGGCAGCCCACAGCACCTGTGCTTTGCTGTCAGGCAGGAGCAGGT 

GATTCTTCAGAACTGCACGGAGGAAGGCCTGGCCATCCACCAGCAGCACTGGGACTTCCAGG 

AGAATGGGATGATTGTCCACATTCTTTCTGGGAAATGCATGGAAGCTGTGGTGCAAGAAAAC 

AATAAAGATTTGTACCTGCGTCCGTGTGATGGAAAAGCCCGCCAGCAGTGGCGATTTGACCA 

GATAAATGCTGTGGATGAACG ATGAA TGTCAATGTCAGAAGGAAAAGAGAATTTTGGCCATC 

AAAATCCAGCTC CAAGTGAACGTAAAGAGCTTAT ATATTT CATGAAGCTGATCCTTTTGTGT 

GTGTGCTCCTTGTGTTAGGAGAGAAAAAAGCTCTATGAAAGAATATAGGAAGTTTCTCCTTT 

TCACACCTTATTTCATTGACTGCTGGCTGCTTA 



WO 00/12708 
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FIGURE 202 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA73760 
xsubunit 1 of 1, 639 aa, 1 stop 
><MW: 73063, pi: 6.84, NX(S/T): 2 

MLLRKRYRHRPCRLQFLLLLLMLGCVLMMVAMLHPPHHTLHQTVTAQASKHSPEARYRLDFG 
E SQDWVLE AEDEGEE YS PLEGLPP F I SLREDQLLVAVALPQARRNQSQGRRGGS YRL I KQ PR 
RQDKEAPKRDWGADEDGEVSEEEELTPFSLDPRGLQEALSARIPLQRALPEVRHPLCLQQHP 
QDSLPTASVILCFHDEAWSTLLRTVHS I LDTVPRAFLKE 1 1 LVDDLSQQGQLKS ALSEYVAR 
LEGVKLLRSNKRLGAIRARMLGATRATGDVLVFMDAHCECHPGWLEPLLSRIAGDRSRWSP 
VI DVIDWKTFQY YPSKDLQRGVLDWKLDFHWEPLPEHVRKALQS P I SP I RSP WPGE WAMD 
RHYFQNTGAYDSLMSLRGGENLELSFKAWLCGGSVEILPCSRVGHIYQNQDSHSPLDQEATL 
RNRVRIAETWLGSFKETFYKHSPEAFSLSKAEKPDCMERLQLQRRLGCRTFHWFLANVYPEL 
YPSEPRPSFSGKiHNTGLGLCADCQAEGDILGCPMVLAPCSDSRQQQYLQHTSRKEIHFGSP 
QHLCFAVRQEQVILQNCTEEGLAIHQQHWDFQENGMIVHILSGKCMEAWQENNKDLYLRPC 
DGKARQQWRFDQ I NAVDER 

Signal peptide: 

amino acids 1-28 



WO 00/12708 
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FIGURE 203 

CGCCAAGCATGCAGTAAAGGCTGAAAATCTGGGTCACAGCTGAGGAAGACCTCAGACATGGA 
GTCCAGGATGTGGCCTGCGCTGCTGCTGTCCCACCTCCTCCCTCTCTGGCCACTGCTGTTGC 
TGCCCCTCCCACCGCCTGCTCAGGGCTCTTCATCCTCCCCTCGAACCCCACCAGCCCCAGCC 
CGCCCCCCGTGTGCCAGGGGAGGCCCCTCGGCCCCACGTCATGTGTGCGTGTGGGAGCGAGC 
ACCTCCACCAAGCCGATCTCCTCGGGTCCCAAGATCACGTCGGCAAGTCCTGCCTGGCACTG 
CACCCCCAGCCACCCCATCAGGCTTTGAGGAGGGGCCGCCCTCATCCCAATACCCCTGGGCT 
ATCGTGTGGGGTCCCACCGTGTCTCGAGAGGATGGAGGGGACCCCAACTCTGCCAATCCCGG 
ATTTCTGGACTATGGTTTTGCAGCCCCTCATGGGCTCGCAACCCCACACCCCAACTCAGACT 
CCATGCGAGGTGATGGAGATGGGCTTATCCTTGGAGAGGCACCTGCCACCCTGCGGCCATTC 
CTGTTCGGGGGCCGTGGGGAAGGTGTGGACCCCCAGCTCTATGTCACAATTACCATCTCCAT 
CATCATTGTTCTCGTGGCCACTGGCATCATCTTCAAGTTCTGCTGGGACCGCAGCCAGAAGC 
GACGCAGACCCTCAGGGCAGCAAGGTGCCCTGAGGCAGGAGGAGAGCCAGCAGCCACTGACA 
GACCTGTCCCCGGCTGGAGTCACTGTGCTGGGGGCCTTCGGGGACTCACCTACCCCCACCCC 
TGACCATGAGGAGCCCCGAGGGGGACCCCGGCCTGGGATGCCCCACCCCAAGGGGGCTCCAG 
CCTTCCAGTTGAACCGGTGAGGGCAGGGGCAATGGGATGGGAGGGCAAAGAGGGAAGGCAAC 
TTAGGTCTTCAGAGCTGGGGTGGGGGTGCCCTCTGGATGGGTAGTGAGGAGGCAGGCGTGGC 
CTCCCACAGCCCCTGGCCCTCCCAAGGGGGCTGGACCAGCTCCTCTCTGGGAGGCACCCTTC 
CTTCTCCCAGTCTCTCAGGATCTGTGTCCTATTCTCTGCTGCCCATAACTCCAACTCTGCCC 
TCTTTGGTTTTTTCTCATGCCACCTTGTCTAAGACAACTCTGCCCTCTTAACCTTGATTCCC 
CCTCTTTGTCTTGAACTTCCCCTTCTATTCTGGCCTACCCCTTGGTTCCTGACTGTGCCCTT 
TCCCTCTTCCTCTCAGGATTCCCCTGGTGAATCTGTGATGCCCCCAATGTTGGGGTGCAGCC 
AAGCAGGAGGCCAAGGGGCCGGCACAGCCCCCATCCCACTGAGGGTGGGGCAGCTGTGGGGA 
GCTGGGGCCACAGGGGCTCCTGGCTCCTGCCCCTTGCACACCACCCGGAAGACTCCCCAGCC 
CCACGGGCAATCCTATCTGCTCGCCCTCCTGCAGGTGGGGGCCTCACATATCTGTGACTTCG 
GGTCCCTGTCCCCACCCTTGTGCACTCACATGAAAGCCTTGCACACTCACCTCCACCTTCAC 
AGGCCATTTGCACACGCTCCTGCACCCTCTCCCCGTCCATACCGCTCCGCTCAGCTGACTCT 
CATGTTCTCTCGTCTCACATTTGCACTCTCTCCTTCCCACATTCTGTGCTCAGCTCACTCAG 
TGGTCAGCGTTTCCTGCACACTTTACCTCTCATGTGCGTTTCCCGGCCTGATGTTGTGGTGG 
TGTGCGGCGTGCTCACTCTCTCCCTCATGAACACCCACCCACCTCGTTTCCGCAGCCCCTGC 
GTGCTGCTCCAGAGGTGGGTGGGAGGTGAGCTGGGGGCTCCTTGGGCCCTCATCGGTCATGG 
TCTCGTCCCATTCCACACCATTTGTTTCTCTGTCTCCCCATCCTACTCCAAGGATGCCGGCA 
TCACCCTGAGGGCTCCCCCTTGGGAATGGGGTAGTGAGGCCCCAGACTTCACCCCCAGCCCA 
CTGCTAAAATCTGTTTTCTGACAGATGGGTTTTGGGGAGTCGCCTGCTGCACTACATGAGAA 
AGGGACTCCCATTTGCCCTTCCCTTTCTCCTACAGTCCCTTTTGTCTTGTCTGTCCTGGCTG 
TCTGTGTGTGTGCCATTCTCTGGACTTCAGAGCCCCCTGAGCCAGTCCTCCCTTCCCAGCCT 
CCCTTTGGGCCTCCCTAACTCCACCTAGGCTGCCAGGGACCGGAGTCAGCTGGTTCAAGGCC 
ATCGGGAGCTCTGCCTCCAAGTCTACCCTTCCCTTCCCGGACTCCCTCCTGTCCCCTCCTTT 
CCTCCCTCCTTCCTTCCACTCTCCTTCCTTTTGCTTCCCTGCCCTTTCCCCCTCCTCAGGTT 
CTTCCCTCCTTCTCACTGGTTTTTCCACCTTCCTCCTTCCCTTCTTCCCTGGCTCCTAGGCT 
GTGATATATATTTTTGTATTAT CTCTTTCTTCTT CTTGTGGTGATCAT CTTGAATTACTGTG 
GGATGTAAGTTTCAAAATTTTCAAATAAAGCCTTTGCAAGATAA 
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PO7US99/20111 



FIGURE 204 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76393 
xsubunit 1 of 1, 243 aa, 1 stop 
><MW: 26266, pi: 8.43, NX(S/T): 1 

MRPQGPAAS PQRLRGLLLLLLLQLPAPS SASE I P KGKQKAQLRQRE WDLYNGMCLQGPAGV 
PGRDGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIA 
ECTFTKMRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMN 
STINIHRTSSVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I I'EELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 



WO 00/12708 



PCT/US99/20IH 



FTGTJRE 205 

GTTAACCAGCGCAGTCCTCCGTGCGTCCCGCCCGCCGCTGCCCTCACTCCCGGCCAGGATGG 
CATCCTGTCTGGCCCTGCGCATGGCGCTGCTGCTGGTCTCCGGGGTTCTGGCCCCTGCGGTG 
CTCACAGACGATGTTCCACAGGAGCCCGTGCCCACGCTGTGGAACGAGCCGGCCGAGCTGCC 
GTCGGGAGAAGGCCCCGTGGAGAGCACCAGCCCCGGCCGGGAGCCCGTGGACACCGGTCCCC 
CAGCCCCCACCGTCGCGCCAGGACCCGAGGACAGCACCGCGCAGGAGCGGCTGGACCAGGGC 
GGCGGGTCGCTGGGGCCCGGCGCTATCGCGGCCATCGTGATCGCCGCCCTGCTGGCCACCTG 
CGTGGTGCTGGCGCTCGTGGTCGTCGCGCTGAGAAAGTTTTCTGCCTCCTGAAGCGAATAAA 
GGGGCCGCGCCCGGCCGCGGCGCGACTCGGCAAAAAAAAAAAAAAA 
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FIGURE 206 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA763 98 
xsubunit 1 of 1, 121 aa, 1 stop 
><MW: 12073, pi: 4.11, NX{S/T): 0 

MASCLALRMALLLVSGVLAPAVLTDDVPQEPVPTLWNEPAELPSGEGPVESTSPGREPVDTG 
PPAPTVAPGPEDSTAQERLDQGGGSLGPGAIAAIVIAALLATCWLALVWALRKFSAS 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 91-110 

Glycosaminoglycan attachment site. 

amino acids 44-47 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 116-119 

N-myristoylation site. 

amino acids 91-96 



WO 00/12708 



PCT/US99/201I1 



FIGURE 207 

GGCCGTTGGTTGGTGCGCGGCTGAAGGGTGTGGCGCGAGCAGCGTCGTTGGTTGGCCGGCGG 

CGGGCCGGGACGGG CATGG CCCTGCTGCTGTGCCTGGTGTGCCTGACGGCGGCGCTGGCCCA 

CGGCTGTCTGCACTGCCACAGCAACTTCTCCAAGAAGTTCTCCTTCTACCGCCACCATGTGA 

ACTTCAAGTCCTGGTGGGTGGGCGACATCCCCGTGTCAGGGGCGCTGCTCACCGACTGGAGC 

GACGACACGATGAAGGAGCTGCACCTGG CCAT CCCCGCCAAG ATCACC CGGGAGAAGCTGGA 

CCAAGTGGCGACAGCAGTGTACCAGATGATGGATCAGCTGTACCAGGGGAAGATGTACTTCC 

CCGGGTATTTCCCCAACGAGCTGCGAAACATCTTCCGGGAGCAGGTGCACCTCATCCAGAAC 

GCCATCATCGAAAGGCACCTGGCACCAGGCAGCTGGGGAGGAGGGCAGCTCTCCAGGGAGGG 

ACCCAGCCTAGCACCTGAAGGATCAATGCCATCACCCCGCGGGGACCTCCCCTAAGTAGCCC 

CCAGAGGCGCTGGGAGTGTTGCCACCGCCCTCCCCTGAAGTTTGCTCCATCTCACGCTGGGG 

GTCAACCTGGGGACCCCTTCCCTCCGGGCCATGGACACACATACATGAAAACCAGGCCGCAT 

CGACTGTCAGCACCGCTGTGGCATCTTCCAGTACGAGACCATCTCCTGCAACAACTGCACAG 

ACTCGCACGTCGCCTGCTTTGGCTATAACTGCGAGTAGGGCTCAGGCATCACACCCACCCGT 

GCCAGGGCCCTACTGTCCCTGGGGTCCCAGGCTCTCCTTGGAGGGGGCTCCCCGCCTTCCAC 

CTGGCTGTCATCGGGTAGGGCGGGGCCGTGGGTTCAGGGGCGCACCACTTCCAAGCCTGTGT 

CCCACAGGTCCTCGGCGCAGTGGAAGTCAGCTGTCCAGGGCCTCCTGAACTACATAAATAAC 

TGGCACAAGTAAGTCCCCTCCTCAAACCAACACAGGCAGTGTGTGTATGTGAGCACCTCGTG 

GGTGAGTATGTGTGGGGCACAGGCTGGCTCCCTCAGCTCCCACGTCCTAGAGGGGCTCCCGA 

GGAGGTGGAACCTCAACCCAGCTCTGCGCAGGAGGCGGCTGCAGTCCTTTTCTCCCTCAAAG 

GTCTCCGACCCTCAGCTGGAGGCGGGCATCTTTCCTAAAGGGTCCCCATAGGGTCTGGTTCC 

ACCCCATCCCAGGTCTGTGGTCAGAGCCTGGGAGGGTTCCCTACGATGGTTAGGGGTGCCCC 

ATGGAGGGGCTGACTGCCCCACATTGCCTTTCAGACAGGACACGAGCATGAGGTAAGGCCGC 

CCTGACCTGGACTTCAGGGGGAGGGGGTAAAGGGAGAGAGGAGGGGGGCTAGGGGGTCCTCT 

AGATCAGTGGGGGCACTG CAGGTGGGGCTCTC CCTATACCTGGGACACCTGCTGGATGTCAC 

CTCTGCAACCACACCCATGTGGTGGTTTCATGAACAGACCACGCTCCTCTGCCTTCTCCTGG 

CCTGGGACACACAGAG CCACCCCGGC CTTGTGAGTGAC CC AGAGAAGGGAGGCCTCGGGAGA 

AGGGGTGCTCGTAAGCCAACACCAGCGTGCCGCGGCCTGCACACCCTTCGGACATCCCAGGC 

ACGAGGGTGTCGTGGATGTGGCCACACATAGGACCACACGTCCCAGCTGGGAGGAGAGGCCT 

GGGGCC CCCAGGGAGGGAGG CAGGGGGTGGGGGACATGGAGAGCTGAGGCAGCCTCGTCTCC 

CCGCAGCCTGGTATCGCCAGCCTTAAGGTGTCTGGAGCCCCCACACTTGGCCAACCTGACCT 

TGGAAGATGCTGCTGAGTGTCTCAAGCAGCACTGACAGCAGCTGGGCCTGCCCCAGGGCAAC 

GTGGGGGCGGAGACTCAGCTGGACAGCCCCTGCCTGTCACTCTGGAGCTGGGCTGCTGCTGC 

CTCAGGACCCCCTCTCCGACCCCGGACAGAGCTGAGCTGGCCAGGGCCAGGAGGGCGGGAGG 

GAGGGAATGGGGGTGGGCTGTGCGCAGCATCAGCGCCTGGGCAGGTCCGCAGAGCTGCGGGA 

TGTGATTAAAGTCCCTGATGTTTCTC 



WO 00/12708 



PCT/US99/20111 



FIGURE 208 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76399 
xsubunit 1 of 1, 157 aa, 1 stop 
><MW: 17681, pi: 7.65, NX(S/T): 1 

MALLLCLVCLTAALAHGCLHCHSNFSKKFSFYRHHVNFKSWWVGDIPVSGALLTDWSDDTMK 
ELHI^IPAKITREKLDQVATAVYQMMDQLYQGKMYFPGYFPNELRNIFREQVHLIQNAIIER 
HLAPGSWGGGQLSREGPSLAPEGSMPSPRGDLP 

Signal peptide: 

amino acids 1-15 



WO 00/12708 



PCT/US99/20111 



FTGTJRE 209 

AGCAGGAGCAGGAGAGGGACA ATG GAAGCTGCCCCGTCCAGGTTCATGTTCCTCTTATTTCT 
CCTCACGTGTGAGCTGGCTGCAGAAGTTGCTGCAGAAGTTGAGAAATCCTCAGATGGTCCTG 
GTGCTGCCCAGGAACCCACGTGGCTCACAGATGTCCCAGCTGCCATGGAATTCATTGCTGCC 
ACTGAGGTGGCTGTCATAGGCTTCTTCCAGGATTTAGAAATACCAGCAGTGCCCATACTCCA 
TAGCATGGTGCAAAAATTCCCAGGCGTGTCATTTGGGATCAGCACTGATTCTGAGGTTCTGA 
CACACTACAACATCACTGGGAACACCATCTGCCTCTTTCGCCTGGTAGACAATGAACAACTG 
AATTTAGAGGACGAAGACATTGAAAGCATTGATG CCAC CAAATTGAGCCGTTTCATTGAGAT 
CAACAGCCTCCACATGGTGACAGAGTACAACCCTGTGACTGTGATTGGGTTATTCAACAGCG 
TAATTCAGATTCATCTCCTCCTGATAATGAACAAGGCCTCCCCAGAGTATGAAGAGAACATG 
CACAGATACCAGAAGGCAGCCAAGCTCTTCCAGGGGAAGATTCTCTTTATTCTGGTGGACAG 
TGGTATGAAAGAAAATGGGAAGGTGATATCATTTTTCAAACTAAAGGAGT CT CAACTG CCAG 
CTTTGGCAATTTACCAGACTCTAGATGACGAGTGGGATACACTG CC CACAGCAG AAGTTTC C 
GTAG AGCATGTGCAAAACTTTTGTGATGGATT CCTAAGTGGAAAAT TGTTGAAAGAAAAT CG 
TG AATCAGAAGGAAAGACTCCAAAGGTGGAACTCTGACTTCTC CTTGGAACTACATATGGC C 
AAGTATCTACTTTATGCAAAGTAAAAAGGCACAACTCAAATCTCAGAGACACTAAACAACAG 
GATCACTAGGCCTGCCAACCACACACACACGCACGTGCACACACGCACGCACGCGTGCACAC 
ACACACGCGCACACACACACACACACAGAGCTTCATTTCCTGTCTTAAAATCTCGTTTTCTC 
TTCTTCCTTCTTTTAAATTTCATATCCTCACTCCCTATCCAATTTCCTTCTTATCGTGCATT 
CATACTCTGTAAGCCCATCTGTAACACACCTAGATCAAGGCTTTAAGAGACTCACTGTGATG 
CCTCTATGAAAGAGAGGCATTCCTAGAGAAAGATTGTTCCAATTTGTCATTTAATATCAAGT 
TTGTATACTGCACATGACTTACACACAACATAGTTCCTGCTCTTTTAAGGTTACCTAAGGGT 
TGAAACTCTACCTTCTTTCATAAGCACATGTCCGTCTCTGACTCAGGATCAAAAACCAAAGG 
ATGGTTTTAAACACCTTTGTGAAATTGTCTTTTTGCCAGAAGTTAAAGGCTGTCTCCAAGTC 
CCTGAACTCAGCAGAAATAGACCATGTGAAAACTCCATGCTTGGTTAGCATCTCCAACTCCC 
TATGTAAATCAACAACCTGCATAATAAATAAAAGGCAATCATGTTATA 



WO 00/12708 
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FIGURE 210 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76401 
xsubunit 1 of 1, 273 aa, 1 stop 
><MW: .30480, pi: 4.60, NX (S/T) : 1 

MEAAPSRFMFLLFLLTCELAAEVAAEVEKSSDGPGAAQEPTWLTDVPAAMEFIAATEVAVIG 
FFQDLE I PAVPI LHSMVQKF PGVS FG I STDS E VLTH YN I TGNT I CLFRLVDNEQLNLEDED I 
ESIDATKLSRFIEINSLHMVTEYNPVTVIGLFNSVIQIHLLLIMNKASPEYEENMHRYQKAA 
KLFQGKILFILVDSGMKENGKVISFFKLKESQLPALAIYQTLDDEWDTLPTAEVSVEHVQNF 
CDGFLSGKLLKENRESEGKTPKVEL 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 143-162 
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FIGURE 211 

GGAGAGCCGCGGCTGGGACCGGAGTGGGGAGCGCGGCGTGGAGGTGCCACCCGGCGCGGGTG 
GCGGAGAGATCAGAAGCCTCTTCCCCAAGCCGAGCCAACCTCAGCGGGGACCCGGGCTCAGG 
GACGCGGCGGCGGCGGCGGCGACTGCAGTGGCTGGACGATGGCAGCGTCCGCCGGAGCCGGG 
GCGGTGATTGCAGCCCCAGACAGCCGGCGCTGGCTGTGGTCGGTGCTGGCGGCGGCGCTTGG 
GCTCTTGACAGCTGGAGTATCAGCCTTGGAAGTATATACGCCAAAAGAAATCTTCGTGGCAA 
ATGGTACACAAGGGAAGCTGACCTGCAAGTTCAAGTCTACTAGTACGACTGGCGGGTTGACC 
TCAGTCTCCTGGAGCTTCCAGCCAGAGGGGGCCGACACTACTGTGTCGTTTTTCCACTACTC 
CCAAGGGCAAGTGTACCTTGGGAATTATCCACCATTTAAAGACAGAATCAGCTGGGCTGGAG 
ACCTTGACAAGAAAGATGCATCAATCAACATAGAAAATATGCAGTTTATACACAATGGCACC 
TATATCTGTGATGTCAAAAACCCTCCTGACATCGTTGTCCAGCCTGGACACATTAGGCTCTA 
TGTCGTAGAAAAAGAGAATTTGCCTGTGTTTCCAGTTTGGGTAGTGGTGGGCATAGTTACTG 
CTGTGGTCCTAGGTCTCACTCTGCTCATCAGCATGATTCTGGCTGTCCTCTATAGAAGGAAA 
AACTCTAAACGGGATTACACTGGCTGCAGTACATCAGAGAGTTTGTCACCAGTTAAGCAGGC 
TCCTCGGAAGTCCCCCTCCGACACTGAGGGTCTTGTAAAGAGTCTGCCTTCTGGATCTCACC 
AGGGCCCAGTCATATATGCACAGTTAGACCACTCCGGCGGACATCACAGTGACAAGATTAAC 
AAGTCAGAGTCTGTGGTGTATGCGGATATCCGAAAGAATTAAGAGAATACCTAGAACATATC 
CTCAGCAAGAAACAAAACCAAACTGGACTCTCGTGCAGAAAATGTAGCCCATTACCACATGT 
AGCCTTGGAGACCCAGGCAAGGACAAGTACACGTGTACTCACAGAGGGAGAGAAAGATGTGT 
ACAAAGGATATGTATAAATATTCTATTTAGTCATCCTGATATGAGGAGCCAGTGTTGCATGA 
TGAAAAGATGGTATGATTCTACATATGTACCCATTGTCTTGCTGTTTTTGTACTTTCTTTTC 
AGGTCATTTACAATTGGGAGATTTCAGAAACATTCCTTTCACCATCATTTAGAAATGGTTTG 
CCTTAATGGAGACAATAGCAGATCCTGTAGTATTTCCAGTAGACATGGCCTTTTAATCTAAG 
GGCTTAAGACTGATTAGTCTTAGCATTTACTGTAGTTGGAGGATGGAGATGCTATGATGGAA 
GCATACCCAGGGTGGCCTTTAGCACAGTATCAGTACCATTTATTTGTCTGCCGCTTTTAAAA 
AATACCCATTGGCTATGCCACTTGAAAACAATTTGAGAAGTTTTTTTGAAGTTTTTCTCACT 
AAAATATGGGGCAATTGTTAGCCTTACATGTTGTGTAGACTTACTTTAAGTTTGCACCCTTG 
AAATGTGTCATATCAATTTCTGGATTCATAATAGCAAGATTAGCAAAGGATAAATGCCGAAG 
GTCACTTCATTCTGGACACAGTTGGATCAATACTGATTAAGTAGAAAATCCAAGCTTTGCTT 
GAGAACTTTTGTAACGTGGAGAGTAAAAAGTATCGGTTTTA 



WO 00/12708 



PCT/US99/2011I 



FIGURE 212 

x/usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA76510 
xsubunit 1 of 1, 269 aa, 1 stop 
><MW: 29082, pi: 9.02, NX(S/T): 3 

MAASAGAGAVI AAPDSRRWLWSVLAAALGLLTAGVS ALEVYTPKE I FVANGTQGKLTCKFKS 
TSTTGGLTSVSWSFQPEGADTTVSFFHYSQGQVYLGNYPPFKDRISWAGDLDKKDASINIEN 
MQ F I HNGT Y I CD VKNP PD I WQ PG H I RLYWE KENL P VF P VWVWG I VT A WLGLTLL I SM I 
LAVLYRRKNSKRDYTGCSTSESLSPVKQAPRKSPSDTEGLVKSLPSGSHQGPVIYAQLDHSG 
GHHSDKINKSESWYADIRKN 

Signal peptide: 

amino acids 1-3 7 

Transmembrane domain: 

amino acids 161-183 



WO 00/12708 
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FIGURE 213 

GCCGGCTGTGCAGAGACGCCATGTACCGGCTCCTGTCAGCAGTGACTGCCCGGGCTGCCGCC 
CCCGGGGGCTTGGCCTCAAGCTGCGGACGACGCGGGGTCCATCAGCGCGCCGGGCTGCCGCC 
TCTCGGCCACGGCTGGGTCGGGGGCCTCGGGCTGGGGCTGGGGCTGGCGCTCGGGGTGAAGC 
TGGCAGGTGGGCTGAGGGGCGCGGCCCCGGCGCAGTCCCCCGCGGCCCCCGACCCTGAGGCG 
TCGCCTCTGGCCGAGCCGCCACAGGAGCAGTCCCTCGCCCCGTGGTCTCCGCAGACCCCGGC 
GCCGCCCTGCTCCAGGTGCTTCGCCAGAGCCATCGAGAGCAGCCGCGACCTGCTGCACAGGA 
TCAAGGATGAGGTGGGCGCACCGGGCATAGTGGTTGGAGTTTCTGTAGATGGAAAAGAAGTC 
TGGTCAGAAGGTTTAGGTTATGCTGATGTTGAGAACCGTGTACCATGTAAACCAGAGACAGT 
TATGCGAATTGCTAGCATCAGCAAAAGTCTCACCATGGTTGCTCTTGCCAAATTGTGGGAAG 
CAGGGAAACTGGATCTTGATATTCCAGTACAACATTATGTTCCCGAATTCCCAGAAAAAGAA 
TATGAAGGTGAAAAGGTTTCTGTCACAACAAGATTACTGATTTCCCATTTAAGTGGAATTCG 
TCATTATGAAAAGGACATAAAAAAGGTGAAAGAAGAGAAAGCTTATAAAGCCTTGAAGATGA 
TGAAAGAGAATGTTGCATTTGAGCAAGAAAAAGAAGGCAAAAGTAATGAAAAGAATGATTTT 
ACTAAATTTAAAACAGAGCAGGAGAATGAAGCCAAATGCCGGAATTCAAAACCTGGCAAGAA 
AAAGAATGATTTTGAACAAGGCGAATTATATTTGAGAGAAAAGTTTGAAAATTCAATTGAAT 
CCCTAAGATTATTTAAAAATGATCCTTTGTTCTTCAAACCTGGTAGTCAGTTTTTGTATTCA 
ACTTTTGGCTATAC CCTACTGGCAGC CATAGTAG AGAGAG CTTCAGGATGTAAATATTTGGA 
CTATATGCAGAAAATATTCCATGACTTGGATATGCTGACGACTGTGCAGGAAGAAAACGAGC 
CAGTGATTTACAATAGAGCAAGGTAAATGAATACCTTCTGCTGTGTCTAGCTATATCGCATC 
TTAACACTATTTTATTAATTAAAAGTCAAATTTTCTTTGTTTCCATTCCAAAATCAACCTGC 
CACATTTTGGGAGCTTTTCTACATGTCTGTTTTCTCATCTGTAAAGTGAAGGAAGTAAAACA 
TGTTTATAAAGTAAAAAAA 



<Z/&/2 7£> 
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FIGURE 214 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA76522 
xsubunit 1 of 1, 373 aa, 1 stop 
><MW: 41221, pi: 8.54, NX(S/T): 0 

MYRLLS AVTARAAAPGGLAS S CGRRGVHQRAGLP PLGHGWVGGLGLGLGLALGVKLAGGLRG 
AAPAQSPAAPDPEASPLAEPPQEQSLAPWSPQTPAPPCSRCFARAIESSRDLLHRIKDEVGA 
PG I WGVSVDGKEVWSEGLGYADVENRVPCKPETVMRI AS I SKSLTMVALAKLWEAGKLDLD 
IPVQHYVPEFPEKEYEGEKVSVTTRLLISHLSGIRHYEKDIKKVKEEKAYKALKMMKENVAF 
EQEKEGKSNEKNDFTKFKTEQENEAKCRNSKPGKKKNDFEQGELYLREKFENS I ESLRLFKN 
DPLFFKPGSQFLYSTFGYTLLAAIVERASGCKYLDYMQKIFHDLDMLTTVQEENEPVIYNRAR 

Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 39-60 



WO 00/12708 



PCIYUS99/201U 



FIGURE 215 

GTGACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGTAAGCTCGGAATTCGGCTCG 
AGGCTGGTGGGAAGAAGCCGAGATGGCGGCAGCCAGCGCTGGGGCAACCCGGCTGCTCCTGC 
TCTTGCTGATGGCGGTAGCAGCGCCCAGTCGAGCCCGGGGCAGCGGCTGCCGGGCCGGGACT 
GGTGCGCGAGGGGCTGGGGCGGAAGGTCGAGAGGGCGAGGCCTGTGGCACGGTGGGGCTGCT 
GCTGGAGCACTCATTTGAGATCGATGACAGTGCCAACTTCCGGAAGCGGGGCTCACTGCTCT 
GGAACCAGCAGGATGGTACCTTGTCCCTGTCACAGCGGCAGCTCAGCGAGGAGGAGCGGGGC 
CGACTCCGGGATGTGGCAGCCCTGAATGGCCTGTACCGGGTCCGGATCCCAAGGCGACCCGG 
GGCCCTGGATGGCCTGGAAGCTGGTGGCTATGTCTCCTCCTTTGTCCCTGCGTGCTCCCTGG 
TGGAGTCGCACCTGTCGGACCAGCTGACCCTGCACGTGGATGTGGCCGGCAACGTGGTGGGC 
GTGTCGGTGGTGACGCACCCCGGGGGCTGCCGGGGCCATGAGGTGGAGGACGTGGACCTGGA 
GCTGTTCAACACCTCGGTGCAGCTGCAGCCGCCCACCACAGCCCCAGGCCCTGAGACGGCGG 
CCTTCATTGAGCGCCTGGAGATGGAACAGGCCCAGAAGGCCAAGAACCCCCAGGAGCAGAAG 
TCCTTCTTCGCCAAATACTGGATGTACATCATTCCCGTCGTCCTGTTCCTCATGATGTCAGG 
AGCGCCAGACACCGGGGGCCAGGGTGGGGGTGGGGGTGGGGGTGGTGGTGGGGGTAGTGGCC 
TTTGCTGTGTGCCACCCTCCCTGTAAGTCTATTTAAAAACATCGACGATACATTGAAATGTG 
TGAACGTTTTGAAAAGCTACAGCTTCCAGCAGCCAAAAGCAACTGTTGTTTTGGCAAGACGG 
TCCTGATGTACAAGCTTGATTGAAATTCACTGCTCACTTGATACGTTATTCAGAAACCCAAG 
GAATGGCTGTCCCCATCCTCATGTGGCTGTGTGGAGCTCAGCTGTGTTGTGTGGCAGTTTAT 
TAAACTGTCCCCCAGATCGACACGCAAAAAAAAA 



WO 00/12708 
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FIGURE 216 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA76529 
xsubunit 1 of l, 269 aa, 1 stop 
><MW: 28004, pi: 5.80, NX(S/T): 1 

MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEI 
DDSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEA 
GGYVSSFVPACSLVESHLSDQLTLHVDVAGNWGVSWTHPGGCRGHEVEDVDLELFNTSVQ 
LQPPTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYI I PWLFLiMMSGAPDTGGQ 
GGGGGGGGGGGSGLCCVPPSL 

Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 226-243 



J2./J/X76 



WO 00/12708 
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FIGURE 217 

GGAGCGCTGCTGGAACCCGAGCCGGAGCCGGAGCCACAGCGGGGAGGGTGGCCTGGCGGCCT 
GGAGCCGGACGTGTCCGGGGCGTCCCCGCAGACCGGGGCAGCAGGTCGTCCGGGGGCCCACC 
ATGCTGGTGACTGCCTACCTTGCTTTTGTAGGCCTCCTGGCCTCCTGCCTGGGGCTGGAACT 
GTCAAGATGCCGGGCTAAACCCCCTGGAAGGGCCTGCAGCAATCCCTCCTTCCTTCGGTTTC 
AACTGGACTTCTATCAGGTCTACTTCCTGGCCCTGGCAGCTGATTGGCTTCAGGCCCCCTAC 
CTCTATAAACTCTACCAGCATTACTACTTCCTGGAAGGTCAAATTGCCATCCTCTATGTCTG 
TGGCCTTGCCTCTACAGTCCTCTTTGGCCTAGTGGCCTCCTCCCTTGTGGATTGGCTGGGTC 
GCAAGAATTCTTGTGTCCTCTTCTCCCTGACTTACTCACTATGCTGCTTAACCAAACTCTCT 
CAAGACTACTTTGTGCTGCTAGTGGGGCGAGCACTTGGTGGGCTGTCCACAGCCCTGCTCTT 
CTCAGCCTTCGAGGCCTGGTATATCCATGAGCACGTGGAACGGCATGACTTCCCTGCTGAGT 
GGATCCCAGCTACCTTTGCTCGAGCTGCCTTCTGGAACCATGTGCTGGCTGTAGTGGCAGGT 
GTGGCAGCTGAGGCTGTAGCCAGCTGGATAGGGCTGGGGCCTGTAGCGCCCTTTGTGGCTGC 
CATCCCTCTCCTGGCTCTGGCAGGGGCCTTGGCCCTTCGAAACTGGGGGGAGAACTATGACC 
GGCAGCGTGCCTTCTCAAGGACCTGTGCTGGAGGCCTGCGCTGCCTCCTGTCGGACCGCCGC 
GTGCTGCTGCTGGGCACCATACAAGCTCTATTTGAGAGTGTCATCTTCATCTTTGTCTTCCT 
CTGGACACCTGTGCTGGACCCACACGGGGCCCCTCTGGGCATTATCTTCTCCAGCTTCATGG 
CAGCCAGCCTGCTTGGCTCTTCCCTGTACCGTATCGCCACCTCCAAGAGGTACCACCTTCAG 
CCCATGCACCTGCTGTCCCTTGCTGTGCTCATCGTCGTCTTCTCTCTCTTCATGTTGACTTT 
CTCTACCAGCCCAGGCCAGGAGAGTCCGGTGGAGTCCTTCATAGCCTTTCTACTTATTGAGT 
TGGCTTGTGGATTATACTTTCCCAGCATGAGCTTCCTACGGAGAAAGGTGATCCCTGAGACA 
GAGCAGGCTGGTGTACTCAACTGGTTCCGGGTACCTCTGCACTCACTGGCTTGCCTAGGGCT 
CCTTGTCCTCCATGACAGTGATCGAAAAACAGGCACTCGGAATATGTTCAGCATTTGCTCTG 
CTGTCATGGTGATGGCTCTGCTGGCAGTGGTGGGACTCTTCACCGTGGTAAGGCATGATGCT 
GAGCTGCGGGTACCTTCACCTACTGAGGAGCCCTATGCCCCTGAGCTGTAACCCCACTCCAG 
GACAAGATAGCTGGGACAGACTCTTGAATTCCAGCTATCCGGGATTGTACAGATCTCTCTGT 
GACTGACTTTGTGACTGTCCTGTGGTTTCTCCTGCCATTGCTTTGTGTTTGGGAGGACATGA 
TGGGGGTGATGGACTGGAAAGAAGGTGCCAAAAGTTCCCTCTGTGTTACTCCCATTTAGAAA 
ATAAAC ACTTTTAAATGAT CAAAAAAAAAAAA 



WO 00/12708 
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FIGURE 218 

MLVTAYLAFVGLIASCLGLELSRCRAKPPGRACSNPSFLRFQLDFYQVYFLALAADWLQAPY 
LYKLYQHYYFLEGQIAILYVCGLASTVLFGLVASSLVDWLGRKNSCVLFSLTYSIiCCLTKLS 
QDYFVLLVGRALGGLSTALLFSAFEAWYIHEHVERHDFPAEWIPATFARAAFWNHVLAVVAG 
VAAEAVASWIGLGPVAPFVAAIPLLALAGALALRNWGENYDRQRAFSRTCAGGLRCLLSDRR 
VLLLGTIQALFESVIFIFVFLWTPVLDPHGAPLGIIFSSFMAASLLGSSLYRIATSKRYHLQ 
PMHLLSLAVIilWFSLFMLTFSTSPGQESPVESFIAFLLIELACGIiYFPSMSFLRRKVIPET 
EQAGVLNWFR VPLHSLACLGLLVLHDSD^KTGTRNMFS I CSAVMVMALLAWGLFTWRHDA 
ELRVPSPTEEPYAPEL 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 41-55, 75-94, 127-143, 191-213, 249-270, 278-299, 
314-330, 343-359, 379-394, 410-430 
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FIGURE 219 

GCGACGCGCGGCGGGGCQGCGAGAGGAAACGCGGCGCCGGGCCGGGCCCGGCCCTGGAGATG 
GTCCCCGGCGCCGCGGGCTGGTGTTGTCTCGTGCTCTGGCTCCCCGCGTGCGTCGCGGCCCA 
CGGCTTCCGTATCCATGATTATTTGTACTTTCAAGTGCTGAGTCCTGGGGACATTCGATACA 
TCTTCACAGCCACACCTGCCAAGGACTTTGGTGGTATCTTTCACACAAGGTATGAGCAGATT 
CACCTTGTCCCCGCTGAACCTCCAGAGGCCTGCGGGGAACTCAGCAACGGTTTCTTCATCCA 
GGACCAGATTGCTCTGGTGGAGAGGGGGGGCTGCTCCTTCCTCTCCAAGACTCGGGTGGTCC 
AGGAGCACGGCGGGCGGGCGGTGATCATCTCTGACAACGCAGTTGACAATGACAGCTTCTAC 
GTGGAGATGATCCAGGACAGTACCCAGCGCACAGCTGACATCCCCGCCCTCTTCCTGCTCGG 
CCGAGACGGCTACATGATCCGCCGCTCTCTGGAACAGCATGGGCTGCCATGGGCCATCATTT 
CCATCCCAGTCAATGTCACCAGCATCCCCACCTTTGAGCTGCTGCAACCGCCCTGGACCTTC 
TGGTAGAAGAGTTTGTCCCACATTCCAGCCATAAGTGACTCTGAGCTGGGAAGGGGAAACCC 
AGGAATTTTGCTACTTGGAATTTGGAGATAGCATCTGGGGACAAGTGGAGCCAGGTAGAGGA 
AAAGGGTTTGGGCGTTGCTAGGCTGAAAGGGAAGCCACACCACTGGCCTTCCCTTCCCCAGG 
GCCCCCAAGGGTGTCTCATGCTACAAGAAGAGGCAAGAGACAGGCCCCAGGGCTTCTGGCTA 
GAACCCGAAACAAAAGGAGCTGAAGGCAGGTGGCCTGAGAGCCATCTGTGACCTGTCACACT 
CACCTGGCTCCAGCCTCCCCTACCCAGGGTCTCTGCACAGTGACCTTCACAGCAGTTGTTGG 
AGTGGTTTAAAGAGCTGGTGTTTGGGGACTCAATAAAC CCTCACTGAC TTTTTAGCAATAAA 
GCTTCTCATCAGGGTTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 220 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76532 
xsubunit 1 of 1, 188 aa, 1 stop 
><MW: 21042, pi: 5.36, NX(S/T): 2 

MVPGAAGWCCLVLWLP AC VAAHGFRI HD YLYFQVLS PGD I RY I FTAT PAKDFGG I FHTRYEQ 
IHLVPAEPPEACGELSNGFFIQDQIALVERGGCSFLSKTRWQEHGGRAVIISDNAVDNDSF 
YVEM I QD STQRTAD I P ALFLLGRDG YM IRRS LEQHGLPWAI I S I PVNVTS I PTFELLQPPWTFW 

Signal peptide: 

amino acids 1-20 



<Z33*276 



WO 00/12708 PCT/US99/20111 

FIGURE 221 

TCTGCCTCCACTGCTCTGTGCTGGGATCATGGAACTTGCACTGCTGTGTGGGCTGGTGGTGA 
TGGCTGGTGTGATTCCAATCCAGGGCGGGATCCTGAACCTGAACAAGATGGTCAAGCAAGTG 
ACTGGGAAAATGCCCATCCTCTCCTACTGGCCCTACGGCTGTCACTGCGGACTAGGTGGCAG 
AGGCCAACCCAAAGATGCCACGGACTGGTGCTGCCAGACCCATGACTGCTGCTATGACCACC 
TGAAGACCCAGGGGTGCGG CAT CTACAAGGACAACAACAAAAGC AG CATACATTGTATGGAT 
TTATCTCAACGCTATTGTTTAATGGCTGTGTTTAATGTGATCTATCTGGAAAATGAGGACTC 
CGAAT^AAAGCTATTACTAWTTNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 222 

></usr/seqdb2/sst/DNA/Dnaseqs,min/ss .DNA76538 
xsubunit 1 of 1, 116 aa, 1 stop 
><MW: 12910, pi: 6.41, NX(S/T): 1 

MELALLCGL WMAG V I PIQGG I LNLNKMVKQVTGKMP I LS YWPYGCHCGLGGRGQPKDATDW 
CCQTHDCCYDHLKTQGCGIYKD1INKSSIHCMDLSQRYCLMAVFNVIYLENEDSE 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 1-24 

N-glycosylation site. 

amino acids 86-89 

N-myristoylation sites. 

amino acids 20-25, 45-50 

Phospholipase A2 histidine active site, 

amino acids 63-70 
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FIGURE 223 

CTCGCTTCTTCCTTCTGGATGGGGGCCCAGGGGGCCCAGGAGAGTATAAAGGCGATGTGGAG 
GGTGCCCGGCACAACCAGACGCCCAGTCACAGGCGAGAGCCCTGGGATGCACCGGCCAGAGG 
CCATGCTGCTGCTGCTCACGCTTGCCCTCCTGGGGGGCCCCACCTGGGCAGGGAAGATGTAT 
GGCCCTGGAGGAGGCAAGTATTTCAGCACCACTGAAGACTACGACCATGAAATCACAGGGCT 
GCGGGTGTCTGTAGGTCTTCTCCTGGTGAAAAGTGTCCAGGTGAAACTTGGAGACTCCTGGG 
ACGTGAAACTGGGAGCCTTAGGTGGGAATACCCAGGAAGTCACCCTGCAGCCAGGCGAATAC 
ATCACAAAAGTCTTTGTCGCCTTCCAAGCTTTCCTCCGGGGTATGGTCATGTACACCAGCAA 
GGACCGCTATTTCTATTTTGGGAAGCTTGATGGCCAGATCTCCTCTGCCTACCCCAGCCAAG 
AGGGGCAGGTGCTGGTGGGCATCTATGGCCAGTATCAACTCCTTGGCATCAAGAGCATTGGC 
TTTGAATGGAATTATCCACTAGAGGAGCCGACCACTGAGCCACCAGTTAATCTCACATACTC 
AGCAAACTCACCCGTGGGTCGCTAGGGTGGGGTATGGGGCCATCCGAGCTGAGGCCATCTGT 
GTGGTGGTGGCTGATGGTACTGGAGTAACTGAGTCGGGACGCTGAATCTGAATCCACCAATA 
AATAAAGCTTCTGCAGAAAA 
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FIGURE 224 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA76541 
xsubunit 1 of 1, 178 aa, 1 stop 
><MW: 19600, pi: 5.89, NX(S/T): 1 

MHRPEAMLLLLTLALLGGPTWAGKMYGPGGGKYFSTTEDYDHEITGLRVSVGLLLVKSVQVK 
LGDSWDVKLGALGGNTQEVTLQPGEYITKVFVAFQAFLRGMVMYTSKDRYFYFGKLDGQISS 
AYPSQEGQVLVGIYGQYQLLGIKSIGFEWNYPLEEPTTEPPVNLTYSANSPVGR 

Signal peptide: 

amino acids 1-22 



WO 00/12708 
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FTGURE 225 

GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCT 

GAACTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAG 

CCCCAAATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTT 

TTTTTTTTAACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATAT 

CCATGAAGATCCTATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATT 

TATTTGTTCTTGGAGTGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCT 

CCCAAGGGGTCCAATTTTTCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTG 

ACAGGGGCTGTCATGCAACTGGCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAA 

CAATACAAAGGATGGGTTTCAATGTAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTT 

ATAGCCCCCACTGTCTTACTGACAATGCTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTG 

TAGGTGTGAAGGCAAAATGGTATATTGTGAATCTCAGAAATTACAGGAGATACCCTCAAGTA 

TATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCTATAACAGCCTTCAAAAACTTAAGTATAAT 

CAATTTAAAGGGCTCAACCAGCTCACCTGGCTATACCTTGACCATAACCATATCAGCAATAT 

TGACGAAAATGCTTTTAATGGAATACGCAGACTCAAAGAGCTGATTCTTAGTTCCAATAGAA 

TCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATTTACGGAACTTGGATCTGTCC 

TATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTGCGGAAGCTGCTGAGTTT 

ACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCAAGACTGCCGCAACC 

TGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATGTCTTTGCTGGC 

ATGATCAGACTCAAAGAACTTCACCTGGAG CACAATCAATTTTC CAAG CTCAAC CTGGCCCT 

TTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGTCATAG 

GACAGACCATGTCCTGGACCTGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGATC 

GAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGA 

TTCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACA 

TCAGTCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTG 

AAAAGTTTTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGG 

AGTAAATGTGATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGT 

TTGATCTGGCCAGGGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCAT 

GAGAGCAAACCCCCTTTGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGC 

TGACGCCGAGCACATCTCTTTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCG 

TGCTCGTCATCCTGCTGGTTATCTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAG 

CTGCAGCAGCGCTCCCTCATGCGAAGGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAAT 

GACTCCCAGCACCCAGGAATTTTATGTAGATTATAAACCCACCAACACGGAGACCAGCGAGA 

TGCTGCTGAATGGGACGGGACCCTGCACCTATAACAAATCGGGCTCCAGGGAGTGTGAGGTA 

TGAACCATTGTGATAAAAAGAGCTCTTAAAAGCTGGGAAATAAGTGGTGCTTTATTGAACTC 

TGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCCTTCCCTCTCCCTCTCACTTTGGTGG 

CAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACTGGTCATTTTCCT CT CATACATA 

ATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTTGAACTCCGGTTTAATATAA 

TACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTTGTTTTAAGATAAAACT 

TCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 226 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA77301 
xsubunit 1 of 1, 513 aa, 1 stop 
><MW: 58266, pi: 9.84, NX{S/T): 4 ■ 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAG 
CLGLSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYF 

LNNT FRP VTNLRNLDL S YNQ LHSLGS EQ FRGLRKLLS LHLRSNS LRT I PVR I FQDCRNLELL 
DLGYNRIRSLARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTM 
SWTWSSLQRLDLSGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLA 
GNIWECSRNICSLVNWLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLA 
RALPKPTFKPKLPRPKHESKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVI 
LLVIYVSWKRYPASMKQLQQRSLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLN 

GTGP CTYNKS GS RE CE V 

Important features of the protein: 
Signal peptide: 

amino acids 1-3 3 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 4 65-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 



2*9 /2 70 
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FIGURE 227 

AGTTCTGAGAAAGAAGGAAATAAACACAGGCACCAAACCACTATCCTAAGTTGACTGTCCTT 
TAAATATGTCAAGATCCAGACTTTTCAGTGTCACCTCAGCGATCTCAACGATAGGGATCTTG 
TGTTTGCCGCTATTCCAGTTGGTGCTCTCGGACCTACCATGCGAAGAAGATGAAATGTGTGT 
AAATTATAATGACCAACACCCTAATGGCTGGTATATCTGGATCCTCCTGCTGCTGGTTTTGG 
TGGCAGCTCTTCTCTGTGGAGCTGTGGTCCTCTGCCTCCAGTGCTGGCTGAGGAGACCCCGA 
ATTGATTCTCACAGGCGCACCATGGCAGTTTTTGCTGTTGGAGACTTGGACTCTATTTATGG 
GACAGAAGCAGCTGTGAGTCCAACTGTTGGAATTCACCTTCAAACTCAAACCCCTGACCTAT 
ATCCTGTTCCTGCTCCATGTTTTGGCCCTTTAGGCTCCCCACCTCCATATGAAGAAATTGTA 
AAAACAACCTGATTTTAGGTGTGGATTATCAATTTAAAGTATTAACGACATCTGTAATTCCA 
AAACATCAAATTTAGGAATAGTTATTTCAGTTGTTGGAAATGTCCAGAGATCTAtTCATATA 
GTCTGAGGAAGGACAATTCGACAAAAGAATGGATGTTGGAAAAAATTTTGGTCATGGAGATG 
TTTAAATAGTAAAGTAGCAGGCTTTTGATGTGTCACTGCTGTATCATACTTTTATGCTACAC 
AACCAAATTAATGCTTCTCCACTAGTATCCAAACAGGCAACAATTAGGTGCTGGAAGTAGTT 
TCCATCACATTTAGGACTCCACTGCAGTATACAGCACACCATTTTCTGCTTTAAACTCTTTC 
CTAGCATGGGGTCCATAAAAATTATTATAATTTAACAATAGCCCAAGCCGAGAATCCAACAT 
GTCCAGAACCAGAACCAGAAAGATAGTATTTGAATGAAGGTGAGGGGAGAGAGTAGGAAAAA 
GAAAAGTTTGGAGTTGAAGGGTAAAGGATAAATGAAGAGGAAAAGGAAAAGATTACAAGTCT 
CAGCAAAAACAAGAGGTTTTATGCCCCAACCTGAAGAGGAAGAAATTGTAGATAGAAGGTGA 
AGGAGATTGCTGAAGATATAGAGCACATATAATGCCAACACGGGGAGAAAAGAAAATTTCCC 
CTTTTACAGTAATGAATGTGGCCTCCATAGTCCATAGTGTTTCTCTGGAGCCTCAGGGCTTG 
GCATTTATTGCAGCATCATGCTAAGAACCTTCGGCATAGGTATCTGTTCCCATGAGGACTGC 
AGAAGTAGCAATGAGACATCTTCAAGTGGCATTTTGGCAGTGGCCATCAGCAGGGGGACAGA 
CAAAAACATCCATCACAGATGACATATGATCTTCAGCTGACAAATTTGTTGAACAAAACAAT 
AAACATCAATAGATATCTAAAAA 



a3Q/27o 
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FIGURE 228 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA77303 
xsubunit 1 of 1, 14 6 aa, 1 stop 
><MW: 16116, pi: 4.99, NX(S/T): 0 

MSRSRLFSVTSAISTIGILCLPLFQLVLSDLPCEEDEMCVNYNDQHPNGWYIWILLLLVLVA 
ALLCGAWLCLQCWLRRPRIDSHRRTMAVFAVGDLDSIYGTEAAVSPTVGIHLQTQTPDLYP 
VPAPCFGPLGSPPPYEEIVKTT 

Signal peptide: 

amino acids 1-2 9 

Transmembrane domain: 

amino acids 52-70 
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FIGURE 229 

GAGCGGAGTAAAATCTCCACAAGCTGGGAACAAACCTCGTCCCAACTCCCACCCACCGGCGT 
TTCTCCAGCTCGATCTGGAGGCTGCTTCGCCAGTGTGGGACGCAGCTGACGCCCGCTTATTA 
GCTCTCGCTGCGTCGCCCCGGCTCAGAAGCTCCGTGGCGGCGGCGACCGTGACGAGAAGCCC 
ACGGCCAGCTCAGTTCTCTTCTACTTTGGGAGAGAGAGAAAGTCAGATGCCCCTTTTAAACT 
CCCTCTTCAAAACTCATCTCCTGGGTGACTGAGTTAATAGAGTGGATACAACCTTGCTGAAG 
ATGAAGAATATACAATATTGAGGATATTTTTTTCTTTTTTTTTTCAAGTCTTGATTTGTGGC 
TTACCTCAAGTTACCATTTTTCAGTCAAGTCTGTTTGTTTGCTTCTTCAGAA ATG TTTTTTA 
CAATCTCAAGAAAAAATATGTCCCAGAAATTGAGTTTACTGTTGCTTGTATTTGGACTCATT 
TGGGGATTGATGTTACTGCACTATACTTTTCAACAACCAAGACATCAAAGCAGTGTCAAGTT 
ACGTGAGCAAATACTAGACTTAAGCAAAAGATATGTTAAAGCTCTAGCAGAGGAAAATAAGA 
ACACAGTGGATGTCGAGAACGGTGCTTCTATGGCAGGATATGCGGATCTGAAAAGAACAATT 
GCTGTCCTTCTGGATGACATTTTGCAACGATTGGTGAAGCTGGAGAACAAAGTTGACTATAT 
TGTTGTGAATGGCTCAGCAGCCAACACCACCAATGGTACTAGTGGGAATTTGGTGCCAGTAA 
CCACAAATAAAAGAACGAATGTCTCGGGCAGTATCAGATAGCAGTTGAAAATCACCTTGTGC 
TGCTCCATCCACTGTGGATTATATCCTATGGCAGAAAAGCTTTATAATTGCTGGCTTAGGAC 
AGAGCAATACTTTACAATAAAAGCTCTACACATTTTCAAGGAGTATGCTGGATTCATGGAAC 
TCTAATTCTGTACATAAAAATTTTAAAGTTATTTGTTTGCTTTCAGGCAAGTCTGTTCAATG 
CTGTACTATGTCCTTAAAGAGAATTTGGTAACTTGGTTGATGTGGTAAGCAGATAGGTGAGT 
TTTGTATAAATCTTTTGTGTTTGAGATCAAGCTGAAATGAAAACACTGAAAAACATGGATTC 
ATTTCTATAACACATTTATTTAAGTATATAACACGTTTTTTGGACAAGTGAAGAATGTTTAA 
TCATTCTGTCATTTGTTCTCAATAGATGTAACTGTTAGACTACGGCTATTTGAAAAAATGTG 
CTTATTGTACTATATTTTGTTATTCCAATTATGAGCAGAGAAAGGAAATATAATGTTGAAAA 
TAATGTTTTGAAATCATGACCCAAAGAATGTATTGATTTG CACTATCCTT CAGAATAACTGA 
AGGTTAATTATTGTATATTTTTAAAAATTACACTTATAAGAGTATAATCTTGAAATGGGTAG 
CAGCCACTGTCCATTACCTATCGTAAACATTGGGGCAATTTAATAACAGCATTAAAATAGTT 
GTAAACTCTAATCTTATACTTATTGAAGAATAAAAGATATTTTTATGATGAGAGTAACAATA 
AAGTATTCATGATTTTTCACATACATGAATGTTCATTTAAAAGTTTAATCCTTTGAGTGTCT 
ATGCTATCAGGAAAGCACATTATTTCCATATTTGGGTTAATTTTGCTTTTATTATATTGGTC 
TAGGAGGAAGGGACTTTGGAGAATGGAACTCTTGAGGACTTTAGCCAGGTGTATATAATAAA 
GGTACTTTTGTGCTGCATTAAATTGCTTGGAAAGTGTTAACATTATATTATATAAGAGTATC 
CTTTATGAAATTTTGAATTTGTATAACAGATGCATTAGATATTCATTTTATATAATGGCCAC 
TTAAAATAAGAACATTTAAAATATAAACTATGAAGATTGACTATCTTTTCAGGAAAAAAGCT 
GTATATAGCACAGGGAACCCTAATCTTGGGTAATTCTAGTATAAAACAAATTATACTTTTAT 
TTAAATTTCCCTTGTAGCAAATCTAATTGCCACATGGTGCCCTATATTTCATAGTATTTATT 
CTCTATAGTAACTGCTTAAGTGCAGCTAGCTTCTAGATTTAGACTATATAGAATTTAGATAT 
TGTATTGTTCGTCATTATAATATGCTACCACATGTAGCAATAATTACAATATTTTATTAAAA 
TAAATATGTGAAATATTGTTTCATGAAAGACAGATTTCCAAATCTCTCTTCTCTTCTCTGTA 
CTGTCTACCTTTATGTGAAGAAATTAATTATATGCCATTGCCAGGT 



\ 
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FIGURE 230 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA77648 
xsubunit 1 of 1, 140 aa, 1 stop 
><MW: 15668, pi: 10.14, NX(S/T): 5 

MFFTISRKNMSQKLSLLLLVFGLIWGLMLLHYTFQQPRHQSSVKLREQILDLSKRYVKAIiAE 
ENKNTVDVENGASMAGYADLKRTI AVLLDD I LQRLVKLENKVDYI WNGSAANTTNGTSGNL 
VPVTTNKRTNVSGS IR 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 



JK33/270 
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FIGURE 231 



CGCGGCCGGGCCGCCGGGGTGAGCGTGCCGAGGCGGCTGTGGCGCAGGCTTCCAGCCCCCAC 
CATGCCGTGGCCCCTGCTGCTGCTGCTGGCCGTGAGTGGGGCCCAGACAACCCGGCCATGCT 
TCCCCGGGTGCCAATGCGAGGTGGAGACCTTCGGCCTTTTCGACAGCTTCAGCCTGACTCGG 
GTGGATTGTAGCGGCCTGGGCCCCCACATCATGCCGGTGCCCATCCCTCTGGACACAGCCCA 
CTTGGACCTGTCCTCCAACCGGCTGGAGATGGTGAATGAGTCGGTGTTGGCGGGGCCGGGCT 
ACACGACGTTGGCTGGCCTGGATCTCAGCCACAACCTGCTCACCAGCATCTCACCCACTGCC 
TTCTCCCGCCTTCGCTACCTGGAGTCGCTTGACCTCAGCCACAATGGCCTGACAGCCCTGCC 
AGCCGAGAGCTTCACCAGCTCACCCCTGAGCGACGTGAACCTTAGCCACAACCAGCTCCGGG 
AGGTCTCAGTGTCTGCCTTCACGACGCACAGTCAGGGCCGGGCACTACACGTGGACCTCTCC 
CACAACCTCATTCACCGCCTCGTGCCCCACCCCACGAGGGCCGGCCTGCCTGCGCCCACCAT 
TCAGAGCCTGAACCTGGCCTGGAACCGGCTCCATGCCGTGCCCAACCTCCGAGACTTGCCCC 
TGCGCTACCTGAGCCTGGATGGGAACCCTCTAGCTGTCATTGGTCCGGGTGCCTTCGCGGGG 
CTGGGAGGCCTTACACACCTGTCTCTGGCCAGCCTGCAGAGGCTCCCTGAGCTGGCGCCCAG 
TGGCTTCCGTGAGCTACCGGGCCTGCAGGTCCTGGACCTGTCGGGCAACCCCAAGCTTAACT 
GGGCAGGAGCTGAGGTGTTTTCAGGCCTGAGCTCCCTGCAGGAGCTGGACCTTTCGGGCACC 
AACCTGGTGCCCCTGCCTGAGGCGCTGCTCCTCCACCTCCCGGCACTGCAGAGCGTCAGCGT 
GGGCCAGGATGTGCGGTGCCGGCGCCTGGTGCGGGAGGGCACCTACCCCCGGAGGCCTGGCT 
CCAGCCCCAAGGTGCCCCTGCACTGCGTAGACACCCGGGAATCTGCTGCCAGGGGCCCCACC 
ATCTTGTGACAAATGGTGTGGCCCAGGGCCACATAACAGACTGCTGTCCTGGGCTGCCTCAG 
GTCCCGAGTAACTTATGTTCAATGTGCCAACACCAGTGGGGAGCCCGCAGGCCTATGTGGCA 
GCGTCACCACAGGAGTTGTGGGCCTAGGAGAGGCTTTGGACCTGGGAGCCACACCTAGGAGC 
AAAGTCTCACCCCTTTGTCTACGTTGCTTCCCCAAACCATGAGCAGAGGGACTTCGATGCCA 
AACCAGACTCGGGTCCCCTCCTGCTTCCCTTCCCCACTTATCCCCCAAGTGCCTTCCCTCAT 
GCCTGGGCCGGCCTGACCCGCAATGGGCAGAGGGTGGGTGGGACCCCCTGCTGCAGGGCAGA 
GTTCAGGTCCACTGGGCTGAGTGTCCCCTTGGGCCCATGGCCCAGTCACTCAGGGGCGAGTT 
TCTTTTCTAACATAGCCCTTTCTTTGCCATGAGGCCATGAGGCCCGCTTCATCCTTTTCTAT 
TTCCCTAGAACCTTAATGGTAGAAGGAATTGCAAAGAATCAAGTCCACCCTTCTCATGTGAC 
AGATGGGGAAACTGAGGCCTTGAGAAGGAAAAAGGCTAATCTAAGTTCCTGCGGGCAGTGGC 
ATGACTGGAGCACAGCCTCCTGCCTCCCAGCCCGGACCCAATGCACTTTCTTGTCTCCTCTA 
ATAAGCCCCACCCTCCCCGCCTGGGCTCCCCTTGCTGCCCTTGCCTGTTCCCCATTAGCACA 
GGAGTAGCAGCAGCAGGACAGGCAAGAGCCTCACAAGTGGGACTCTGGGCCTCTGACCAGCT 
GTGCGGCATGGGCTAAGTCACTCTGCCCTTCGGAGCCTCTGGAAGCTTAGGGCACATTGGTT 
CCAGCCTAGCCAGTTTCTCACCCTGGGTTGGGGTCCCCCAGCATCCAGACTGGAAACCTACC 
CATTTTCCCCTGAGCATCCTCTAGATGCTGCCCCAAGGAGTTGCTGCAGTTCTGGAGCCTCA 
TCTGGCTGGGATCTCCAAGGGGCCTCCTGGATTCAGTCCCCACTGGCCCTGAGCACGACAGC 
CCTTCTTACCCTCCCAGGAATGCCGTGAAAGGAGACAAGGTCTGCCCGACCCATGTCTATGC 
TCTACCCCCAGGGCAGCATCTCAGCTTCCGAACCCTGGGCTGTTTCCTTAGTCTTCATTTTA 
TAAAAGTTGTTGCCTTTTTAACGGAGTGTCACTTTCAACCGGCCTCCCCTACCCCTGCTGGC 
CGGGGATGGAGACATGTCATTTGTAAAAGCAGAAAAAGGTTGCATTTGTTCACTTTTGTAAT 
ATTGTCCTGGGCCTGTGTTGGGGTGTTGGGGGAAGCTGGGCATCAGTGGCCACATGGGCATC 
AGGGGCTGGCCCCACAGAGACCCCACAGGGCAGTGAGCTCTGTCTTCCCCCACCTGCCTAGC 
CCATCATCTATCTAACCGGTCCTTGATTTAATAAACACTATAAAAGGTTTAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FTGTJRE 232 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA77652 
xsubunit 1 of 1, 3 53 aa, 1 stop 
><MW: 37847, pi: 6.80, NX(S/T): 2 

MPWPLLLLLAVSGAQTTRPCFPGCQCEVETFGLFDSFSLTRVDCSGLGPHIMPVPIPLDTAH 
LDLSSNRLEMVNESVLAGPGYTTLAGLDLSHNLLTSISPTAFSRLRYLESLDLSHNGLTALP 
AESFTSSPLSDVNLSHNQLREVSVSAFTTHSQGRALHVDLSHNLIHRLVPHPTRAGLPAPTI 
QSLNLAWNRLHAVPNLRDLPLRYLSLDGNPLAVIGPGAFAGLGGLTHLSLASLQRLPELAPS 
GFRELPGLQVLDLSGNPKLNWAGAEVFSGLSSLQELDLSGTNLVPLPEALLLHLPALQSVSV 
GQDVRCRRLVREGT YPRRPGS S PKVPLHCVDTRE SAARGPT I L 

Signal peptide : 
amino acids 1-16 

Transmembrane domains : 

amino acids 215-232, 287-304 
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FIGURE 233 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGG 
GGACCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCA 
AAACAAGTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCT 
GTTCCAGGCCTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGC 
CACGACCTGTGCCACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCAC 
GGGAGGCTTGGCAGTTTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCC 
TCTAGTCTTGCCTTCAGCCTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGG 
ACTGAAGACACTCAATTTGGGAAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATG 
GATTTTCTGAGATACGGGGCAGTGTGCAAGCCAAAGATGGAAACATTGACATCAGAATCTTA 
AGGAGGACTGAGTCTTTGCAAGACACAAAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTT 
GCTAAGACTCTATCTGGACAGGGTATTTAAAAACTACCAGACCCCTGACCATTATACTCTCC 
GGAAGATCAGCAGCCTCGCCAATTCCTTTCTTACCATCAAGAAGGACCTCCGGCTCTCTCAT 
GC CCAC ATGACATGCCATTGTGGGGAGGAAGCAATGAAGAAATACAG CCAGATTCTGAGT CA 
CTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGCTTTGGGGGAACTAGACATTCTTC 
TGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTGCTGCTAAGAATATTCGAGGT 
CAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAAACCACCATCTCTTTACT 
GTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTGCTTCCTTGCATGAT 
TGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTTTGTAATATCTT 
TCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTAATGTATTT 
ATTTTTTTACTTGGACATGAAACTTTAAAAAAATTCACAGATTATATTTATAACCTGACTAG 
AGCAGGTGATGTATTTTTATACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGGCT 
AGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGA 
TATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATT 
GCACATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTAT 
CTTCCAGCCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATA 
CCAAAAAAAAAAAAAAAAAAA 
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FIGURE 234 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA83500 
xsubunit 1 of 1, 261 aa, 1 stop 
><MW: 29667, pi: 8.76, NX(S/T): 0 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRP 
EIFSSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNL 
QE I RNGFS E I RGSVQAKDGN I D I R I LRRTE S LQDTK PANRCCLLRHLLRLYLDRVFKNYQT P 
DHYTLRKISSLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALG 
ELDILLQWMEETE 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 2 

cAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 



^3-7 /z? a 
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FIGURE 235 

CCGTTATCGTCTTGCGCTACTGCTG AATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCG 
CTGACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGC 
CGAGCTAGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAG 
CTCTTGCTCGGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACA 
GCCCTAGGGATCATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCAT 
TTACAGACACGTAGTGTATTCTGGAGGTCGAATGGTCACATATGAAGATCTCCGAGAGGTTG 
TGTTTGGCAAAAGTGAAGATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATG 
GCTGGTGTTATTGGCCAGTTTTTAGCCAATCCAACTGACCTAGTGAAGGTTCAGATGCAAAT 
GGAAGGAAAAAGGAAACTGGAAGGAAAACCATTGCGATTTCGTGGTGTACATCATGCATTTG 
CAAAAATCTTAGCTGAAGGAGGAATACGAGGGCTTTGGGCAGGCTGGGTACCCAATATACAA 
AGAGCAGCACTGGTGAATATGGGAGATTTAACCACTTATGATACAGTGAAACACTACTTGGT 
ATTGAATACAC C ACTTGAGGACAATATC ATGACTCACGGTTTATCAAGTTTATGTT CTGGAC 
TGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCATCAAAAGCAGAATAATGAATCAACCA 
CGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCGACTGACTGCTTGATTCAGGCTGT 
TCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACCATCTTGGCTGAGAATGACCC 
CTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGATGAGTGGAGTCAGTCCA 
TTT TAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA77568 * 
xsubunit 1 of 1, 323 aa, 1 stop 
><MW: 36064, pi: 9.33, NX(S/T): 1 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGA 
RESAPYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEH 
YPLWKSVIGGMMAGVIGQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKILAEGGI 
RGLWAGWVPNIQRAALVNMGDLTTYDfVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTP 
AD V I KSR I MNQPRDKQGRGLLYKS STDCL I Q AVQGEGFMS LYKGFLP SWLRMTPWSMVFWLT 
YEKIREMSGVSPF 

Transmembrane domains : 

amino acids 25-38, 130-147, 233-248 
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FIGURE 237 

CGGACGCGTGGGCGCGGGACGCCGGCAGGGTTGTGGCGCAGCAGTCTCCTTCCTGCGCGCGC 
GCCTGAAGTCGGCGTGGGCGTTTGAGGAAGCTGGGATACAGCATTTAATGAAAAATTTATGC 
TTAAGAAGTAAAA ATG GCAGGCTTCCTAGATAATTTTCGTTGGCCAGAATGTGAATGTATTG 
ACTGGAGTGAGAGAAGAAATGCTGTGGCATCTGTTGTCGCAGGTATATTGTTTTTTACAGGC 
TGGTGGATAATGATTGATGCAGCTGTGGTGTATCCTAAGCCAGAACAGTTGAACCATGCCTT 
TCACACATGTGGTGTATTTTCCACATTGGCTTTCTTCATGATAAATGCTGTATCCAATGCTC 
AGGTGAGAGGTGATAGCTATGAAAGCGGCTGTTTAGGAAGAACAGGTGCTCGAGTTTGGCTT 
TTCATTGGTTTCATGTTGATGTTTGGGTCACTTATTGCTTCCATGTGGATTCTTTTTGGTGC 
ATATGTTACCCAAAATACTGATGTTTATCCGGGACTAGCTGTGTTTTTTCAAAATGCACTTA 
TATTTTTTAGCACTCTGATCTACAAATTTGGAAGAACCGAAGAGCTATGGACCTGAGATCAC 
TTCTTAAGTCACATTTTCCTTTTGTTATATTCTGTTTGTAGATAGGTTTTTTATCTCTCAGT 
ACACATTGCCAAATGGAGTAGATTGTACATTAAATGTTTTGTTTCTTTACATTTTTATGTTC 
TGAGTTTTGAAATAGTTTTATGAAATTTCTTTATTTTTCATTGCATAGACTGTTAATATGTA 
TATAATACAAGACTATATGAATTGGATAATGAGTATCAGTTTTTTATTCCTGAGATTTAGAA 
CTTGATCTACTCCCTGAGCCAGGGTTACATCATCTTGTCATTTTAGAAGTAACCACTCTTGT 
CTCTCTGGCTGGGCACGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGG 
CCGATTGCTTGAGGTCAAGTGTTTGAGACCAGCCTGGCCAACATGGCGAAACCCCATCTACT 
AAAAATACAAAAATTAGCCAGGCATGGTGGTGGGTGCCTGTAATCCCAGCTACCTGGGAGGC 
TGAGGCAGGAGAATCGCTTGAACCCGGGGGGCAGAGGTTGCAGTGAGCTGAGTTTGCGCCAC 
TGCACTCTAGCCTGGGGGAGAAAGTGAAACTCCCTCTCAAAAAAAAGACCACTCTCAGTATC 
TCTGATTTCTGAAGATGTACAAAAAAATATAGCTTCATATATCTGGAATGAGCACTGAGCCA 
TAAAAGGTTTTCAGCAAGTTGTAACTTATTTTGGCCTAAAAATGAGGTTTTTTTGGTAAAGA 
AAAAATATTTGTTCTTATGTATTGAAGAAGTGTACTTTTATATAATGATTTTTTAAATGCCC 
AAAGGACTAGTTTGAAAGCTTCTTTTAAAAAGAATTCCTCTAATATGACTTTATGTGAGAA 
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FIGURE 238 

MAGFLDNFRWPECECIDWSERRNAVASWAGILFFTGWWIMIDAAWYPKPEQLNHAFHTCG 
VFSTLAFFMINAVSNAQVRGDSYESGCLGRTGARVWLFIGFMLMFGSLIASMWILFGAYVTQ 
NTDVYPGLAVF FQNAL I FFSTL I YKFGRTEELWT 

Important features : 
Signal peptide: 

amino acids 1-44 

Transmembrane domains : 

amino acids 23-42 (type II), 60-80, 97-117, 128-148 
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FIGURE 239 

GTTGATGGCAAACTTCCTCAAAGGAGGGGCAGAGCCTGCGCAGGGCAGGAGCAGCTGGCCCA 

CTGGCGGCCCGCAACACTCCGTCTCACCCTCTGGGCCCACTGCATCTAGAGGAGGGCCGTCT 

GTGAGGCCACTACCCCTCCAGCAACTGGGAGGTGGGACTGTCAGAAGCTGGCCCAGGGTGGT 

GGTCAGCTGGGTCAGGGACCTACGGCACCTGCTGGACCACCTCGCCTTCTCCATCGAAGCAG 

GGAAGTGGGAGCCTCGAGCCCTCGGGTGGAAGCTGACCCCAAGCCACCCTTCACCTGGACAG 

GATGAGAGTGTCAGGTGTGCTTCGCCTCCTGGCCCTCATCTTTGCCATAGTCACGACATGGA 

TGTTTATTCGAAGCTACATGAGCTTCAGCATGAAAACCATCCGTCTGCCACGCTGGCTGGCA 

GCCTCGCCCACCAAGGAGATCCAGGTTAAAAAGTACAAGTGTGGCCTCATCAAGCCCTGCCC 

AGCCAACTACTTTGCGTTTAAAATCTGCAGTGGGGCCGCCAACGTCGTGGGCCCTACTATGT 

GCTTTGAAGACCGCATGATCATGAGTCCTGTGAAAAACAATGTGGGCAGAGGCCTAAACATC 

GCCCTGGTGAATGGAACCACGGGAGCTGTGCTGGGACAGAAGGCATTTGACATGTACTCTGG 

AGATGTTATGCACCTAGTGAAATTCCTTAAAGAAATTCCGGGGGGTGCACTGGTGCTGGTGG 

CCTCCTACGACGATCCAGGGACCAAAATGAACGATGAAAGCAGGAAACTCTTCTCTGACTTG 

GGGAGTTCCTACGCAAAACAACTGGGCTTCCGGGACAGCTGGGTCTTCATAGGAGCCAAAGA 

CCTCAGGGGTAAAAGCCCCTTTGAGCAGTTCTTAAAGAACAGCCCAGACACAAACAAATACG 

AGGGATGGCCAGAGCTGCTGGAGATGGAGGGCTGCATGCCCCCGAAGCCATTTTAGGGTGGC 

TGTGGCTCTTCCTCAGCCAGGGGCCTGAAGAAGCTCCTGCCTGACTTAGGAGTCAGAGCCCG 

GCAGGGGCTGAGGAGGAGGAGCAGGGGGTGCTGCGTGGAAGGTGCTGCAGGTCCTTGCACGC 

TGTGTCGCGCCTCTCCTCCTCGGAAACAGAACCCTCCCACAGCACATCCTACCCGGAAGACC 

AGCCTCAGAGGGTCCTTCTGGAACCAGCTGTCTGTGGAGAGAATGGGGTGCTTTCGTCAGGG 

ACTGCTGACGGCTGGTCCTGAGGAAGGACAAACTGCCCAGACTTGAGCCCAATTAAATTTTA 

TTTTTGCTGGTTTTGAAAAAAAAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA59814 
<subunit 1 of 1, 224 aa, 1 stop 
<MW: 24963, pi: 9.64, NX(S/T): 1 

MRVSGVLRLLALIFAIVTTWMFIRSYMSFSMKTIRLPRWLAASPTKEIQVKKYKCGLIKPCP 
ANYFAFKI CSGAANWGPTMCFEDRM I MS P VKNNVGRGLN I ALVNGTTGAVLGQKAFDMYSG 
DVMHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKD 
LRGKSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF 

Important features: 
Signal peptide: 

amino acids 1-15 

ATP/ GTP -binding site motif A (P-loop) . 
amino acids 184-191 

N-glycosylation site. 

amino acids 107-110 



^3/27 o 
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FTGIJRE 241 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGG CAGCAAGAGATTTGTC CTGGGG AT 

CCAGAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACA 

GCAAGAGAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTC 

TCACTCCTCCCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCC 

CTGCACCCCTTCCTGGGACACT ATG TTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCC 

TGGCTGCAGATGGGGGTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCA 

GCCTCTTACCCTGAGTGTGGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGT 

GACATTTGACCCTGATTTGCCTGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGC 

CTTTGGACCTGCACAACAATGGCCACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTG 

GGTGGACTTCCCCGAAAATATGTAGCTGCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATC 

CCCAGGGGGGTCAGAACACCAGATCAACAGTGAAGCCACATTTGCAGAGCTCCAGATTGTAC 

ATTATGACTCTGATTCCTATGACAGCTTGAGTGAGGCTGCTGAGAGGCCTCAGGGCCTGGCT 

GTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAGAATATAGCTTATGAACACATTCTGAG 

TCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTCAGTGCCTCCCTTCAACCTAAGAG 

AGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATGGCTCGCTCACAACTCCCCCT 

TGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAGATTTCAATGGAACAGCT 

GGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAAGCTTCTGGTACAGA 

ACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCATCCAAGCAGGA 

TCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGCTGTCTCTG 

CCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAACCGAA 

AGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATACCA 

TGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTG 

GCCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAG 

GAATGGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTA 

GGAGGAAATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGG 

GAAGTTTGGGATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATA 

TACTGCGGGATCTC TC CTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATA 

TATTTGGAAATTAAAGTTTCTGACTTT 
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FIGURE 242 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA62812 
xsubunit 1 of 1, 337 aa, 1 stop 
><MW: 37668, pi- 6.27, NX(S/T): 1 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLP 
ALQPHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQ 
INSEATFAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRH 
KDQKTSVPPFNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTL 
FSTEEEPSKLLVQNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYF 
I ARK I RKKRLENRKS WFTS AQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 243 

AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTC 
GTGGACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGG 
ATCAACTCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTC 
CGGATCAGGGAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTG 
ATACCATTAACACAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGG 
AATGACACCTGGTACCCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAAC 
TGCACCCACATGTGTTACCAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGC 
TCAGAGGAATTGCCACAAATCTTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCAT 
CCTGCCCACCAGTCAGGCAGGGGCTAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAG 
GAGCAGGTGTAAATCCTGCCACCCAGGGAACCCCAGCAGGCCGCCTCCCAACTCeCAGTGGC 
ACAGATGACGACTTTGCAGTGACCACCCCTGCAGGCATCCAAAGGAGCACACATGCCATCGA 
GGAAGCCACCACAGAATCAGCAAATGGAATTCAGTAAGCTGTTTCAAATTTTTTCAACTAAG 
CTGCCTCGAATTTGGTGATACATGTGAATCTTTATCATTGATTATATTATGGAATAGATTGA 
GACACATTGGATAGTCTTAGAAGAAATTAATTCTTAATTTACCTGAAAATATTCTTGAAATT 
TCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAACAATAATTCAATGGATAAAT 
CTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATATTAAAACATATTTGGAAA 
ACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAA 
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FIGURE 244 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 
amino acids 1-16 
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GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCG 
GAGCCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCG 
GCAGCCGGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCC 
TGCTGCTCCTGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAG 
CAAAAGGCGCAGCTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGG 
GCCAGCAGGAGTGCCTGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTG 
GGATCCCAGGTCGGGATGGATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAG 
GAGTCCTGGACACCCAACTACAAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCT 
TGGGAAAATTGCGGAGTGTACATTTACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGT 
TCAGTGGCTCACTTCGGCTAAAATGCAGAAATGCATGCTGTCAGCGTTGGTATTTCACATTC 
AATGGAGCTGAATGTTCAGGACCTCTTCCCATTGAAGCTATAATTTATTTGGACCAAGGAAG 
CCCTGAAATGAATTCAACAATTAATATTCATCGCACTTCTTCTGTGGAAGGACTTTGTGAAG 
GAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGGTTGGCACTTGTTCAGATTACCCAAAA 
GGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATCATTATTGAAGAACTACCAAAATA 
AATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCCTTGGAATGGTTCACTTAAAT 
GACATT TTAAATAAGTTTATGTATACATCTGAATGAAAAG CAAAGCTAAATATGTTTACAGA 
CCAAAGTGTGATTTCACACTGTTTTTAAATCTAGCATTATTCATTTTGCTTCAATCAAAAGT 
GGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATTCTCTCAACCTA 
TAATTTGGAATATTGTTGTGGT CTTTTGTTTTTTCT CTTAGT AT AG CATTTTTAAAAAAATA 
TAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGTTAAAT 
AAAAATT ATTTC C AACA 
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FIGURE 246 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA76393 
xsubunit 1 of 1, 243 aa, 1 stop 
><MW: 26266, pi: 8.43, NX(S/T): 1 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGV 
PGRDGSPGANVI PGTPGI PGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIA 
ECTFTKMRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMN 
STINIHRTSSVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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PRO 



XXXXXXXXXXXXXXX (Length 
XXXXXYYYYYYY (Length 



15 amino acids) 



Comparison Protein 



12 amino acids) 



% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide 
sequences as determined by ALIGN-2) divided by (the total number of amino acid residues of 
the PRO polypeptide) = 

5 divided by 15 = 33.3% 
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FIGURE 247B 



PRO 



XXXXXXXXXX (Length 
XXXXXYYYYYYZZYZ (Length 



10 amino acids) 
15 amino acids) 



Comparison Protein 



% amino acid sequence identity = 

(the number of identically matching amino acid residues between the two polypeptide 
sequences as determined by AUGN-2) divided by (the total number of amino acid residues of 
the PRO polypeptide) = 

5 divided by 10 = 50% 
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PRO-DNA NNNNNNNNNNNNNN (Length = 14 nucleotides) 

Comparison DNA NNNNNNLLLLLLLLLL (Length = 16 nucleotides) 

% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as 
determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic 
acid sequence) = 

6 divided by 14 = 42.9% 
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FIGURE 247D 



PRO-DNA 



NNNNNNNNNNNN 



(Length -12 nucleotides) 

(Length = 9 nucleotides) 



Comparison DNA 



NNNNLLLVV 



% nucleic acid sequence identity = 

(the number of identically matching nucleotides between the two nucleic acid sequences as 
determined by ALIGN-2) divided by (the total number of nucleotides of the PRO-DNA nucleic 
acid sequence) = 

4 divided by 12 = 33.3% 
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/* 

* C-C increased from 12 to 15 

* Z is average of EQ 

* B is average of ND 

* match with stop is M; stop-stop = 0; J (joker) match = 0 
*/ 

#define _M -8 /* value of a match with a stop */ 
int _day[26][26] = { 

/* ABCDEFGHIJKLMNOPQRSTUVWXYZ*/ 
/*A*/ {2,0,-2,0,0,-4, 1,-1,-1, 0,-1.-2,-1, 0,_M, 1.0,-2, 1, 1,0.0,-6,0,-3,0}, 
/* B */ { 0. 3,-4, 3, 2,-5, 0, 1.-2, 0, 0.-3.-2, 2, M ( -l, 1, 0. 0. 0, 0,-2.-5, 0,-3, 1}, 
/* C */ {-2,-4,15,-5,-5,-4,-3,-3,-2, 0,-5,-6,-5.-47 M,-3,-5,-4. 0,-2, 0,-2,-8. 0. 0,-5}, 
/* D */ { 0, 3,-5, 4, 3.-6, 1, 1,-2, 0, 0,^,-3, 2,_M,-1. 2,-1, 0, 0, 0,-2,-7, 0,-4, 2}, 
/* E */ { 0. 2,-5, 3, 4,-5, 0, 1.-2, 0, 0,-3,-2, 1. M.-l, 2,-1. 0, 0, 0,-2,-7, 0,-4, 3}. 
/* F */ {-4,-5.^.-6,-5, 9,-5,-2, 1, 0,-5, 2, O.^Cm.-S.-S, A-3,-3, 0,-1. 0, 0, 7,-5}. 
/*G*/ { 1,0,-3, 1,0,-5, 5,-2,-3, 0.-2,-4.-3. o/m,-1,-1,-3, 1,0,0,-1,-7,0,-5,0}, 
/* H */ {-1, 1.-3. 1, 1,-2,-2, 6,-2, 0, 0.-2.-2. 2,~M. 0, 3. 2,-1,-1, 0,-2,-3, 0. 0, 2}, 
/* I */ {-1,-2,-2,-2,-2, 1,-3,-2, 5, 0,-2, 2. 2,-2,_M,-2,-2,-2,-l, 0. 0, 4,-5, 0,-1,-2}, 
/* J */ { 0, 0. 0, 0. 0. 0, 0. 0. 0, 0, 0, 0, 0. 0,_M. 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}, 
/* K */ {-1, 0,-5. 0, 0.-5.-2, 0,-2, 0, 5,-3, 0, 1, M.-l, 1. 3, 0, 0. 0.-2.-3. 0,-4, 0}. 
/* L */ {-2,-3,-6,-4.-3, 2.-4.-2, 2, 0,-3, 6, 4,-3~M.-3,-2,-3,-3,-l, 0, 2,-2, 0,-1,-2}, 
/* M */ {-1,-2,-5,-3,-2. 0.-3,-2, 2, 0, 0. 4, 6,-2, M.-2.-1, 0,-2,-1, 0. 2,-4. 0,-2.-1}, 
/* N */ { 0, 2,-4, 2, 1,-4, 0, 2,-2, 0, 1.-3,-2, 2,~M,-1, 1, 0, 1, 0, 0,-2,^. 0,-2, 1}, 

/* O */ {_M,_M,_M,_M,_M._M,_M ,_M, M, M,_M,_M,_M._M, 0,_M._M,_M,_M._M,_M,_M,_M,_M,_M,_M}. 

/* P */ { 1.-1.-3,-1,-1,-5,-1, 0,-2, 0,-1,-3,-2,-f. M, 6, 0, 0, 1, 0. 0,-1,-6, 0,-5, 0}. 

/* Q */ { 0, 1,-5. 2, 2,-5,-1. 3,-2, 0, 1,-2,-1, 1,~M, 0, 4. 1,-1,-1, 0.-2,-5, 0,-4. 3}, 

/* R */ {-2, 0.-4,-1.-1,-4,-3, 2,-2. 0, 3,-3, 0, 0, _ M, 0, 1, 6, 0,-1, 0,-2, 2, 0,-4, 0}, 

/*S*/ { 1,0,0,0,0.-3, 1,-1,-1,0,0,-3.-2, 1,~M, 1,-1,0,2, 1,0.-1,-2,0,-3.0}, 

/* T */ { 1, 0,-2, 0, 0,-3, 0,-1, 0, 0, 0,-1,-1, 0,~M, 0,-1,-1, 1, 3, 0, 0,-5, 0.-3, 0}, 

/* U */ { 0, 0. 0, 0, 0, 0, 0. 0, 0, 0, 0. 0, 0. 0 JM, 0. 0. 0, 0, 0. 0, 0, 0, 0, 0, 0}, 

/* V */ { 0,-2,-2,-2,-2,-1.-1,-2, 4, 0,-2, 2, 2.-2, M,-l.-2.-2,-l, 0, 0, 4,-6, 0,-2,-2}, 

/* W */ {-6,-5,-8,-7,-7, 0,-7.-3,-5, 0,-3,-2,-4,-4,~M,-6,-5, 2,-2,-5, 0,-6,17. 0, 0,-6}. 

/* X */ { 0, 0. 0, 0, 0, 0, 0, 0, 0, 0, 0. 0, 0. 0, M, 0. 0, 0, 0, 0. 0, 0. 0, 0, 0, 0}, 

/* Y */ {-3,-3, 0,-4,-4, 7,-5, 0,-1, 0.-4,-1,-2,-27 M,-5,-4,-4,-3,-3, 0,-2, 0, 0.10,-4}, 

/* Z */ { 0, 1,-5. 2, 3.-5. 0, 2,-2, 0, 0.-2,-1, 1 Jm, 0, 3. 0, 0. 0, 0,-2,-6, 0,-4, 4} 

}; 
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^include <stdio.h> 




^include <ctypc.h> 




# define 


MAXJMP 


16 


^define 


MAX GAP 


24 


^define 


JMPS 


1024 


#define 


MX 


4 


#defuie 


DMAT 


3 


^define 


DMIS 


0 


Idefine 


DINSO 


8 


^define 


DINS1 


1 


^define 


PINSO 


8 


^define 


PINS I 


4 


struct jmp { 





/* max jumps in a diag */ 

/* don't continue to penalize gaps larger than this */ 
/* max jmps in an path */ 

/* save if there's at least MX- 1 bases since last jmp */ 

/* value of matching bases */ 

/* penalty for mismatched bases */ 

/* penalty for a gap */ 

/* penalty per base */ 

/* penalty for a gap */ 

/* penalty per residue */ 



short n [MAXJMP]; /* size of jmp (neg for dely) */ 

unsigned short x [MAXJMP]; /* base no. of jmp in seq x */ 

/* limits seq to 2*16-1 */ 



struct diag { 
int 
long 
short 
struct jmp 

}; 



score; 
offset; 
ijmp; 

jp; 



/* score at last jmp */ 
/* offset of prev block */ 
/* current jmp index */ 
/* list of jmps */ 



struct path { 

int spc; /+ number of leading spaces */ 

short n[JMPS];/* size of jmp (gap) */ 

int x[JMPS];/* loc of jmp (last elem before gap) */ 



char 




♦ofile; 


/* output file name */ 


char 




*namex[2]; 


/* seq names: getseqsO */ 


char 




*prog; 


/* prog name for err msgs */ 


char 




*seqx[2]; 


/* seqs: getseqsO */ 


int 




dmax; 


/* best diag: nwfj */ 


int 




dmaxO; 


/* final diag */ 


int 




dna; 


/* set if dna: mainO */ 


int 




endgaps; 


/* set if penalizing end gaps */ 


int 




gapx, gapy; 


/* total gaps in seqs */ 


int 




lenO, leal; 


/* seq lens */ 


int 




ngapx, ngapy; 


/* total size of gaps */ 


int 




smax; 


/* max score: nw() */ 


int 




*xbm; 


/* bitmap for matching */ 


long 




offset; 


/* current offset in jmp file */ 


struct 


diag 


*dx; 


/* holds diagonals */ 


struct 


path 


PP[2]; 


/* holds path for seqs */ 


char 




*calloc0, *mallocO. * 


indexO, *strcpy0; 


char 




*getseq0. *g_calloc0; 
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/* Needleman-Wunsch alignment program 
* 

* usage: progs filel file2 

* where filel and file2 are two dna or two proiein sequences. 

* The sequences can be in upper- or lower-case an may contain ambiguity 

* Any lines beginning with ';','> ' or * < ' are ignored 

* Max file length is 65535 (limited by unsigned short x in the jmp struct) 

* A sequence with 1/3 or more of its elements ACGTU is assumed to be DNA 

* Output is in the file "align.out" 
* 

* The program may create a tmp file in /tmp to hold info about traceback. 

* Original version developed under BSD 4.3 on a vax 8650 
*/ 

^include "nw.h" 
^include "day.h" 



static dbval[26] = { 

T.14,2,13,0,0,4, 11.0,0,12,0,3.15,0,0.0,5,6^8.7.9.0,10,0 

}; 

static _pbval[26] = { 

1, 2|(1< <( , D , -'A'))|(1< <( , N , - , A')) t 4, 8, 16. 32, 64, 
128, 256, OxFFFFFFF. 1< < 10, 1<<11, 1<<12, 1<<13, 1<<14, 
I<<15, 1<<16, 1<<17. 1<<18, 1<<19, 1< <20, l<<21, 1< <22. 
1 < <23, 1 < <24, 1 < <25|(1 < <('E*-'A*))|(1 < KCQ'-'A')) 



main(ac, av) 

int ac; 
char *avQ; 

{ 

prog = av[0]; 
if(ac!= 3) { 

fprintf(stderr." usage: %s filel file2\n\ prog); 

fjjrimf^tderr," where filel and file2 are two dna or two protein sequencesAn"); 
f>rintf(stderr,"The sequences can be in upper- or lower-case\n"); 
fj)rintf(stderr."Any lines beginning with *;' or ' < ' are ignored\n"); 
fJ>rintf(stderr."Output is in the file \"align.out\"\n"); 
extt(l); 

} 

namex[0] = av[l]; 

ramex[ll ~ av[2]; 

seqx[0] = getseq(namex[0], &ten0); 

seqxfl] = getseq(namex[l], &lenl); 

xbm = (dna)? .dbvai : _pbval; 

endgaps - 0; /* 1 to penalize endgaps */ 

ofile = "align.out"; /* output file */ 

nwO; /* fill in the matrix, get the possible jmps */ 

readjmpsO; /* get the actual jmps */ 

printO; /* print stats, alignment */ 

cleanup(0); /* unlink any imp files */ 



main 
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/* do the alignment, return best score: mainO 

* dna: values in Fitch and Smith, PNAS. 80, 1382-1386, 1983 

* pro: PAM 250 values 

* When scores are equal, we prefer mismatches to any gap, prefer 

* a new gap to extending an ongoing gap, and prefer a gap in seqx 

* to a gap in seq y. 
*/ 

nw() 
{ 



char 


*px, *py; 


/* seqs and ptrs */ 


int 


♦ndely, *dely; 


/* keep track of dely */ 


int 


ndetx, delx: 


/* keep track of delx */ 


bit 


*tmp; 


/* for swapping rowO, rowl */ 


int 


mis; 


/* score for each type */ 


int 


insO, ins I; 


/* insertion penalties */ 


register 


id: 


/* diagonal index */ 


register 


u; 


/* jmp index */ 


register 


*col0, *coll; 


/* score for curr, last row */ 


register 


xx, yy; 


/* index into seqs */ 



dx = (struct diag *)g_caIloc("lo get diags", lenO+lenl +1, sizeof(struct diag)); 

ndely = (int *)g_calloc("to get ndely", lenl + 1, sizeof(int)); 
dely - (int *)g_calloc("to get dely", lenl + 1, sizeof(int)); 
colO = (int *)g_calloc("to get colO", lenl + 1, sizeof(int)); 
coll = (int *)g_calloc("to get coll", lenl + 1, sizeof(int)); 
insO = (dna)? DINSO : PINSO; 
insl = (dna)? DINS1 : PINS I; 

smax = -10000; 
if (endgaps) { 

for (col0[01 = dely[0] = -insO, yy = I; yy < = lenl; yy++) { 
colO[yy] = dely[yy] = colO[yy-l] - insl; 
ndely [yy] - yy; 

} 

co!0[0] = 0; /* Waterman Bull Math Biol 84 */ 

> 

else 

for (yy ~ 1; yy < = lenl; yy++) 
dely[yy] = -insO; 

/* fill in match matrix 
*/ 

for (px = seqx[0]. xx = 1; xx < = lenO; px+ + , xx++) { 
/* initialize first entry in col 
*/ 

IF (endgaps) { 

if (xx « I) 

coll[0] = delx = -(insO + insl); 

else 

coll[0] = delx = colO[0] - insl; 
ndetx = xx ; 

} 

else { 

coll[0] = 0; 
delx - -insO; 
ndelx = 0; 

} 
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...nw 

for (py = seqx[l], yy = 1; yy < - lenl; py+ + , yy + +) { 
mis « co!0[yy-l]; 
if(dna) 

mis += (xbmi*px~'A']&.xbml*py-'A'W DMAT : DMIS; 

else 

mis + = _day(*px-'A , ][*py- , A'J; 

/* update penalty for del in x seq; 

* favor new del over ongong del 

* ignore MAXGAP if weighting endgaps 
*/ 

if (endgaps [| ndely[yy] < MAXGAP) { 

if (co\0{yy] - ihsO > = delytyyj) { 

delytyyj = col0[yy] - (insO + insl); 
ndely[yy] = 1; 

}else{ 

delytyyj -= insl; 
ndely[yy] + + ; 

} 

}else{ 

if (col0[yyl - (insO+insl) > = dely[yy]) { 
delytyyj - colOfyyJ - (insO+insl); 
ndely[yyj = 1; 

}else 

ndely[yy] + + ; 

} 

/* update penalty for del in y seq; 

* favor new del over ongong del 
*/ 

if (endgaps j | ndeix < MAXGAP) { 

if (coll{yy-l] - insO > = delx) { 

delx - colilyy-11 - (insO+insl); 
ndelx = I; 

}else{ 

delx -= insl; 
ndelx+ + ; 

} 

} else { 

if (colltyy-1] - (insO+insl) > = delx) { 
delx = co!I[yy-lJ - (insO+insl); 
ndelx = 1; 

}else 

ndelx+ + ; 

} 

/* pick the maximum score; we're favoring 

* mis over any del and delx over dely 
*/ 
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...nw 

id = xx- yy + tenl - 1; 

if (mis > = delx && mis > = detylyy]) 

coll[yy] = mis; 
else if (delx > - dely[yy]) { 

coll[yy] = delx; 

ij = dx[id].ijmp; 

if (dx[id]jp.n[0] && (!dna | j (ndelx > = MAXJMP 
&& xx > dxfid] jp.x[ij]+MX) 1 1 mis > dx[id].score+DINSO)) { 
dx[id].ijmp+ + ; 
if( + +ij >= MAXJMP) { 
writejmps(id); 
ij = dx(idj.ijmp = 0; 
dx[id] .offset = offset; 

offset += sfceof (struct jmp) + sizeof(offset); 

} 

} 

dxfid] jp.n[ij] = ndelx; 
dx[id].jp.x[ij] = xx; 
dx[id]. score = delx; 

> 

else { 

colltyy] = dely[yy]; 
ij = dx[id],ijmp; 

if (dx[id].jp.n[0] && (!dna 1 1 (ndely[yy] > = MAXJMP 

&& xx > dx[id].jp.x[ij]+MX) | [ mis > dx [id], score +DINS0)) { 
dx[id].ijmp+ + ; 
if(++ij > = MAXJMP) { 
writejmps(id); 
ij = dx[id].ijmp = 0; 
dx[id].offset = offset; 

offset += sizeof (struct jmp) + sizeof(offset); 

} 

} 

dx[id].jp.n[ijj = -ndelytyy]; 
dxfid] .jp.x[ij] = xx; 
dx[id] .score = dely[yy]; 

} 

if (xx = = lenO && yy < Ienl) { 
/* last col 
*/ 

if (endgaps) 

collfyy] -= insO-f insl*(lenl-yy); 
if (colllyy] > smax) { 

smax = coll[yy]; 

dmax = id; 

} 

} 

} 

if (endgaps &&. xx < lenO) 

coll[yy-l] -= ins0+insl*(len0-xx); 
if (coll[yy-l) > smax) { 

smax = coll[yy-I]; 

dmax = id; 

} 

tmp = colO; coIO = coll; coll = tmp; 

(void) free((char *)ndely); 
(void) free<(char *)dely); 
(void) free((char *)co!0); 
(void) free((char *)coll); 
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/* 

* primO — only routine visible outside this module 
* 

* static: 

* getmatf) - trace back best path, count matches: printO 

* pr_alignO ~ print alignment of described in array pQ: prini() 

* dumpblockO - dump a block of lines with numbers, stars: pr_align() 

* numsO — put out a number line: dumpblockO 

* putlineO - put out a line (name, {num], seq, Inum]): dumpblockO 

* starsO - -put a line of stars: dumpblockO 

* stripnameO -- strip any path and prefix from a seqname 
*/ 



^include "nw.h" 



#define SPC 3 

#define P LINE 256 /* maximum output line */ 

#define P_SPC 3 /* space between name or num and seq */ 

extern _day[26][26]; 

int olen: /* set output line length */ 

FILE *fx; /* output file */ 

printO P rint 
{ 

int Ix, ly, firstgap, lastgap; /* overlap */ 

if «fx = fopen(ofile, *w")) 0) { 

fprintf(stderr, ,, %s: can't write %s\n", prog, oftle); 
cleanup(l); 

} 

tprimf(fx, • < first sequence: %s (length - %d)\n", namex(0], lenO): 
r>rintf(fx, "< second sequence: %s (length - %d)\n", namex[l], lenl); 
olen = 60; 
Ix = lenO; 
ly = lenl; 

firstgap = lastgap - 0; 

if (dmax < lenl - 1) { /* leading gap in x */ 
pp[0].spc = firstgap = lenl - dmax - 1; 
ly-= ppJOJ.spc; 

} 

else if (dmax > lenl - 1) { /* leading gap in y */ 
pp[l].spc = firstgap — dmax - (lenl - I); 
ix-= pp[l].spc; 

} 

if (dmaxO < lenO - 1) { /* trailing gap in x */ 
lastgap = lenO - dmaxO -1; 
lx -~ lastgap; 

} 

else if (dmaxO > lenO - 1) { /* trailing gap in y */ 
lastgap - dmaxO - (lenO - 1); 
ly -= lastgap; 

getmat(lx, ly, firstgap, lastgap); 
pr_alignQ; 
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* trace back the best path, count matches 
*/ 

static 

getmat(lx, ly, firstgap, lastgap) 

int lx, ly; /* "core" (minus endgaps) */ 

int firstgap, lastgap; /* leading trailing overlap */ 



{ 



int nm, i0.il, sizO, sizl; 

char outx[32]; 

double pet; 

register nO.nl; 

register char *p0, *pl; 

/* get total matches, score 
*/ 

i0 = il *= sizO = sizl = 0; 
pO = seqx[0] + pp[l].spc; 
pi - seqx[l] + pp(0].spc; 
nO - pp[l].spc + I; 
nl = pp(0].spc + 1; 

nm = 0; 

while (*pO&& *pl){ 
If (sizO) { 

pl + + ; 
nl + + ; 
sizO-; 

} 

else if (sizl) { 

p0+ + ; 



nO+ + ; 
sizl-; 



> 

else { 



if (xbm[*pO-' A' ]&xbm[*p 1 -' A" }) 

nm+ + ; 
if(nO + + «=pp[0].x[iO]> 

sizO = pp[0].n[iO+ +]; 
lf(nl + + ==pp[l]-x[il]) 

sizl = pp[l].n[il + +l; 

p0+ + ; 
pl + + ; 

} 

} 

/* pet homology: 

* if penalizing endgaps, base is the shorter seq 

* else, knock off overhangs and take shorter core 
*/ 

if (endgaps) 

Ix = (lenO < lenl)? lenO : lent; 

else 

lx = (Ix < ly)? Ix : ly; 
pet = lOO.*(double)nm/(double)lx; 
fprintf(fx, "Vn"); 

fprtntf(fx, " < %d match %s in an overlap of %d: %.2f percent similarity\n 
nm, (nm == t)? "" : "es\ lx, pet); 
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fprintf(fx, " <gaps in first sequence: %d", gapx); 
if(gapx){ 

(void) sprimf(outx t " (%d %s%s)\ 

ngapx, (dna)? "base": "residue", (ngapx == 1)? 

fprimf(fx,"%s\ outx); 



getmat 



V); 



fprintf(fx, ", gaps in second sequence: %d", gapy); 
if (gapy){ 

(void) sprintf(outx, " (%d %s%s)\ 

ngapy. (dna)? "base": "residue", (ngapy = = I)? "":V): 

fprintf(fx,"%s\ outx); 

} 

if (dna) 

fprintf(fx, 

"\n< score: %d (match = %d, mismatch - %d, gap penalty = %d + %d per base)\n\ 
smax, DM AT, DMIS, DINSO, DINS1); 

else 

fprintf(fx, 

*\n< score: %d (Day hoff PAM 250 matrix, gap penalty = %d + %d per residue)\n" , * 
smax, PINSO. PINS I); 
if (endgaps) 

fprintf(fx, 

" <endgaps penalized, left endgap: %d %s%s, right endgap: %d %s%s\n", 
firstgap, (dna)? "base" : "residue", (firstgap ~~ 1)? "" : "s\ 
lastgap, (dna)? "base" : "residue", (lastgap == 1)? "" : "s"); 



else 



fprintf(fx, "< endgaps not penalized\n"); 



static nm; 

static Imax; 

static ij[2]; 

static nc[2]; 

static ni[2]; 

static siz[21; 

static char *ps[2]; 

static char *po[2]; 

static char out[2][P_LINE); /* output line */ 

static char star[P_UNE]; /* set by starsQ */ 



/* matches in core - for checking */ 
/* lengths of stripped file names */ 
/* jmp index for a path */ 
/* number at start of current line */ 
/* current elem number — for gapping */ 

/* ptr to current element */ 
/* ptr to next output char slot */ 



/* 

* print alignment of described in struct path ppn 
*/ 

static 
pr alignO 
{ 



int 
int 

register 



nn; 

more; 

i; 



/* char count */ 



pralign 



for (i = 0, lmax = 0; i < 2; i+ +) { 
nn = stripname(namex[i]); 
if (nn > Imax) 

lmax = nn: 



nc[i] = 1; 
ni[i] « 1; 
siz[i) = ij[i] = 0; 
ps[i] = seqx[i]; 
po[i] « out[i]; 
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for (nn = nm = 0, more = 1; more; ) { ...pr_align 
for (i = more » 0; i < 2; i+ +) { 
/* 

* do we have more of this sequence? 
♦/ 

if(!*ps[i]) 

continue; 



more+ + ; 

if (pp[']-spc) { /* leading space */ 
*po[i] + + = 1 *; 
pp[i].spc-; 

} 

else if (siz[ij) { /* in a gap */ 
*po[i] + + = 
siz[i]--; 

} 

else { /* we're putting a seq element 

*/ 

*po[iJ = *ps[i]; 
if (islower(*ps[i])) 

*ps[i] = toupper(*ps[i]); 

po[i) + + ; 
ps[i] + + ; 

/* 

* are we at next gap for this seq? 
*/ 

if(n,[i] P p[i] x(ij[i]]> { 
/* 

* we need to merge alt gaps 

* at this location 
*/ 

siz[i] = pp[i].n[ij[i] + +l; 
while (ni[i] == pp[i].x[ij[i]]) 

sizfi] +- pp[i].n[ij[i] + +]; 

} 

ni[i] + + ; 

} 

} 

if (+ +nn = = olen 1 1 !more && nn) { 
dumpblockO; 
for(i - 0; i < 2; i++) 
po[i] = out[i]; 

nn = 0; 

} 

} 

} 

/* 

* dump a block of lines, including numbers, stars: pr alignO 
*/ 

static 

dumpblockO dumpblock 

register i; 

for (i = 0; i < 2; i++) 
*po[i]- « '\0'; 
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(void) putc(*\n\ fx); 

for (i = 0; i < 2; i + +) { 

if <*out[i] && (*out[i] != " i I *(poW) !=")){ 
if(i==0) 

nums(i); 
if(i = = 0&& *out[l]) 
starsO; 

putiine(i); 

if(i = =0&& *out{l]) 
fprimf(fx, star); 

if(t= = 1) 

nums(i); 

} 

} 

} 

/* 

* put out a number line: dumpblockO 
*/ 

static 
nums(ix) 

int ix; /* index in out[] holding seq line */ 

{ 

char nIine[P_UNE]; 

register i, j; 

register char *pn, *px, *py; 

for (pn = nline, i = 0; i < lmax+P_SPC; i+ + , pn + +) 
*pn = ' 

for (i - nc[ix], py = out[ix); *py; py+ + , pn++) { 

if(*PY == " II *py 

*pn = ' '; 



else { 



if (i%10 == 0 || (i == 1 && nc[ix] != 1)) { 
j - (i < 0)? -i : i; 
for(px = pn; j; j 10, px-) 
*px = j%10 + '0'; 

if(i < 0) 

*px = V; 

} 

else 

*pn = ' '; 



} 

} 

*pn = *\0'; 
nclix] = i; 

for (pn = nline; *pn; pn++) 
(void) putc(*pn, fx); 
(void) putc('\n\ fx); 



* put out a line (name, [num], seq, [num]): dumpblockO 
*/ 

static 
putline(ix) 

int ix; 

{ 



f X7t> 
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...putline 

fnt i; 
register char *px; 

for(px - namex[ix], i = 0; *px && *px != ':'; px+ + , 

(void) putc(*px, fx); 
for(; i < Imax+PjSPC; i++) 

(void) putc(' \ fx); 

/* these count from 1: 

* niO is current element (from 1) 

* ncQ is number at start of current line 
*/ 

for(px - outfix]; *px; px-t-+) 

(void) putc(*px&0x7F, fx); 
(void) puic('\n\ fx); 

} 



/* 

* put a line of stars (seqs always in out[0], out[l]): dumpblockO 
*/ 

static 

starsO stars 

{ 

int i; 

register char *p0, *pl t cx, *px; 

if (!*out[01 1 1 (*out(0] = = " && *(po[0]) " * ') U 
!*out[l] 1 1 (*out[ll = *(po[l]) == ")) 

return: 
px = star; 

for(i = lmax+P_SPC; i; i~) 
*px++ = ' *; 

for(pO = out[0], pi = out[l]; *p0&& *pl; p0+ + , pl + +) { 
if (isalpha(*pO) && isalpha(*pl)) { 

if (xbm[*pO- , A']&xbm[*pl-'A']) { 
cx = '*'; 
nm+ + ; 

} 

elseif(!dna&&_day[*pO-*A , ][*pl- , A , l > 0) 

cx =* V; 

else 

cx = ' '; 

} 

else 

cx = ' ': 
*px+ + - cx; 

} 

*px++ = '\n'; 
*px = '\0'; 

} 
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/* 

* strip path or prefix from pn, return len: pr alignO 
*/ 

static 

stripname(pn) Stripname 
char *pn; /* file name (may be path) */ 

{ 

register char *px, *py; 
py = 0; 

for (px = pn; *px; px++) 
if ( * px ==•/•) 

py = px + I; 

if (py) 

(void) strcpy(pn, py); 
return(strlen(pn)); 
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/* 

* cleanupO - cleanup any tmp file 

* getseqO - read in seq. set dna, len, maxlen 

* g_calloc() - callocO with error checkin 

* readjmpsO — get the go°d jmps, from tmp file if necessary 

* writejmpsO ~ w r* tt a W ied array of jmps to a imp file: nwO 
*/ 

^include "nw.h" 
^include <sys/file.h> 



char 
FILE 

int 
long 



*jname = 

cleanupO; 
IseekO; 



Vtmp/homgXXXXXX' 



/* tmp file for jmps */ 
/* cleanup tmp file */ 



* remove any tmp file if we blow 
*/ 



cleanup(i) 
{ 



int 
exit(i); 



(void) unlink(jname); 



cleanup 



/* 

* read, return ptr to seq, set dna, len, maxlen 

* skip lines starting with ';','<', or 1 > ' 

* seq in upper or lower case 
*/ 

char * 

getseq(ftle, len) 

char *file; /* file name */ 
int *len; /* seq len */ 



{ 



char 

register char 

int 

FILE 



line[1024], *pseq; 
*px, *py; 
natgc, tlen; 

•fp; 



getseq 



if ((fp - fopen(file.V)) == 0) { 

fprintf(stderr,"%s: can't read %s\n\ prog, file); 
exit(l); 

} 

tlen = natgc - 0; 

while (fgets(line, 1024, fp)) { 

if(*line « V || *line == '<* || *line == '>') 

continue; 
for (px = line; *px !« *\n*; px + +) 

if (isupper(*px) 1 1 islower(*px)) 
tlen+ + ; 

} 

if ((pseq = malloc((unsigned){tlen+6))) -= 0) { 

fprintf(stderr,"%s: mallocO failed to get %d bytes for %s\n", prog, tlen+6, file); 
exit(l); 

} 

pseq[0] « pseqll] = pseq[2] = pseq[3] = '\0'; 
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...getseq 

py = pseq + 4; 
*len - tlen; 
rewind(fp); 

while (fgets(line. 1024, fp)) { 

if(*line=- *;* 1 1 *line — '<■ || *line == *>*) 
continue; 

for (px = line; *px !=* *\n'; px++) { 
if (isupper(*px)) 

*py++ = *px; 
else if (islower(*px)) 

*py+ + « toupper(*px); 
if {indexCATGCU\*(py-l))) 

natgc+ + ; 

} 

} 

*py++ = 'UT; 

*py = '\0'; 

(void) fclose(fp); 

dna = natgc > (tlen/3); 

return(pseq+4); 

} 

char * 

g_calloc(msg, nx, sz) g_Call0C 
char *msg; /* program, calling routine */ 

int nx, sz; /* number and size of elements */ 

{ 

char *px, *ca!locO; 

if ((px = calIoc((unsigned)nx, (unsigned)sz)) = = 0) { 
if(*msg){ 

f£rintf(scderr, "%s: g_calloc0 failed %s (n=*%d. sz=%d)\n\ prog, msg, nx, sz); 
exit(l); 

} 

} 

return(px); 

} 
/* 

* get final jmps from dxQ or tmp file, set ppQ, reset dmax: mainO 
*/ 

readjmpsO readjmps 

{ 

int fd = -l; 

int siz. K), il; 

register i,j, xx; 

if(Q>{ 

(void) fclose(fj); 

if «fd = open(iname, O^RDONLY, 0)) < 0) { 

ftiriittf(stden\ "%s: can't openO %sW, prog, jname); 
cleanup(l); 

} 

> 

for (i = iO = il = 0, dmaxO = dmax, xx = lenO; ; { 
while (I) { 

for (j = dx[dmax].ijmp; j > - 0 && dx[dmaxl.jp.x[j] > = xx; j~) 
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FIGURE 248P 

...readjmps 

if (j < 0 && dx[dmax].offset && fj) { 

(void) lseek(fd, dx[dmax]. offset. 0); 

(void) readffd, (char *)&dx [dmaxj.jp, sizeof(struct jmp)); 

(void) read(fd, (char *)&dx[dmax] .offset, si2eof(dx{dmaxJ.offset)); 

dx[dmax].ijmp « MAXJMP-1; 

} 

else 

break; 

} 

if (i > = JMPS) { 

tprintf(stderr, "%s: too many gaps in alignment^" , prog); 
cleanup(l); 

} 

if G >-0){ 

siz = dx[dmax].jp.n[j]; 
xx = dx[dmax].jp.x[j]; 
dmax + = siz; 

if (siz < 0) { /* gap in second seq */ 

pp[l].n[ilj = -siz; 
xx + = siz; 

/* id ~ xx - yy + lenl - 1 
*/ 

pp[l].x[il] - xx - dmax + lenl - 1; 
gapy+ + ; 
ngapy -= siz; 
/* ignore MAXGAP when doing endgaps */ 

siz = (-siz < MAXGAP 1 1 endgaps)? -siz : MAXGAP; 
il ++; 

} 

else if (siz > 0) { /* gap in first seq */ 
pp[0].n[i0] = siz; 
pp[0].x[i0J = xx; 
gapx++; 
ngapx + = siz; 
/* ignore MAXGAP when doing endgaps */ 

siz « (siz < MAXGAP 1 1 endgaps)? siz : MAXGAP; 
K)+ + ; 

} 

} 

else 

break; 

> 

/* reverse the order of jmps 
*/ 

for (j = 0, K)-; j < tO; j + + , iO-) { 

i = pp[0].n[j]; ppfOJ.nlj] = pp[0].n[i0J; pp[0].n[i0] = i; 
i = PP[0].x[j]; pp[0].x[j] « pp[0].x[K)]; pp[0].x[i0] = i; 

} 

for (j = 0, il — ; j < il;j + + . il~) { 

i = pp[l].n[j]; PP[l].nOJ - pp[l].n[il]; pp[l].n[il] = i; 
i = PPtU-xUl: PPlU.xU] = pp[l].x[il]; pp[l].x[il] = i; 

} 

if <fd > = 0) 

(void) close(fd); 

if(fj){ 

(void) unlink(jname); 
fj =0; 
offset - 0; 

} 



WO 00/12708 

j 



POYUS99/2011I 



FIGURE 248Q 

/* 

* write a filled jmp struct offset of the prev one (if any): nwO 
*/ 

writejmps(ix) witejmps 
int ix; 

{ 

char *mktemp0; 
if(!fj){ 

if (mktemp(jname) < 0) { 

rprintf(stderr, "%s: can't mktempO %s\n". progjname); 
cleanup(l); 

} 

if ((fj = fopen(jname, "w")) «= 0) { 

r>rintf(stderr, can't write %s\n", progjname); 
exit(l); 

} 

} 

(void) fwrite((char *)&dx(ixjjp, sizeof(struct jmp). 1, fj); 
(void) fwrite((char *)&dx[ix]. offset, sizeof(dx[ix] .offset), 1, fj); 

} 
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1. Claims: 1-27 all partially 

An isolated nucleic acid having at least 80% homology to a 
nucleotide sequence that encodes a polypeptide comprising an 
amino acid sequence shown in figure 2 SEQ ID 4; said 
nucleotide sequence comprising a nucleotide sequence shown 
in Figure 1 SEQ ID 3; a vector comprising said nucleic acid; 
a host cell comprising said vector; a process for producing 
said protein using said host; an isolated PRO polypeptide 
having at least 80% identity to an amino acid sequence that 
encodes a polypeptide conprising an amino acid sequence 
shown in figure 2 SEQ ID 4 designated by PR01560 
polypeptide; a chimeric molecule comprising said polypeptide 
fused to a heterologous amino acid sequence; an antibody 
against said polypeptide; an isolated extracellular domain 
of this PR01560 polypeptide or an isolated polypeptide 
having at least 80% amino acid identity with said domain; an 
isolated PR01560 polypeptide lacking its associated signal 
peptide or an isolated polypeptide having at least 80% amino 
acid identity with said fragment; an isolated nucleic acid 
encoding one of said polypeptides. 
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Inventions 2-123: Idem as subject 1 but -limited to each of 
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